4.6 Article

Dual residual attention module network for single image super resolution

期刊

NEUROCOMPUTING
卷 364, 期 -, 页码 269-279

出版社

ELSEVIER
DOI: 10.1016/j.neucom.2019.06.078

关键词

Super-resolution; Dual residual attention module; Local information integration

资金

  1. National Natural Science Foundation of China [61871308, 61472304, 61772402, U1605252]
  2. Fundamental Research Funds for the Central Universities
  3. Innovation Fund of Xidian University

向作者/读者索取更多资源

Recent studies show that research on single image super-resolution (SISR) has achieved great success by using deep convolutional neural network (CNN). Different types of features obtained in deep CNN have different contribution. However, most of the previous models ignore the distinction between different features and deal with them in the same way, which affects the representational capacity of the models. On the other hand, receptive fields with different size capture diverse features from the input. Based on the above considerations, we propose a dual residual attention module (DRAM) network which concentrates on recovering the high-frequency details and sharing the information between two receptive fields of different sizes. We construct local information integration (LFI) module as the basic module to make full use of the local information. The LFI module is a cascade of several dual residual attention fusion (DRAF) blocks with a dense connection structure. The feature modulation can focus on important features and suppress unimportant ones. The evaluation results on five benchmark datasets demonstrate the superiority of our DRAM network against the state-of-the-art algorithms. (C) 2019 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

Lightweight image super-resolution with feature enhancement residual network

Zheng Hui, Xinbo Gao, Xiumei Wang

NEUROCOMPUTING (2020)

Article Computer Science, Artificial Intelligence

Hierarchical convolutional neural network via hierarchical cluster validity based visual tree learning

Yu Zheng, Qiuyu Chen, Jianping Fan, Xinbo Gao

NEUROCOMPUTING (2020)

Article Computer Science, Information Systems

Progressive perception-oriented network for single image super-resolution

Zheng Hui, Jie Li, Xinbo Gao, Xiumei Wang

Summary: Deep neural networks have proven to significantly enhance single image super-resolution performance. While previous methods aiming at PSNR maximization often result in blurred images at high upscaling factors, the introduction of GANs has helped to mitigate this issue by generating impressive results with synthetic high-frequency textures. However, GAN-based approaches may introduce fake textures and artifacts to enhance the visual resolution of super-resolved images. This paper proposes a new perceptual image super-resolution method that progressively generates high-quality results by constructing a stage-wise network.

INFORMATION SCIENCES (2021)

Article Computer Science, Artificial Intelligence

Support structure representation learning for sequential data clustering

Xiumei Wang, Dingning Guo, Peitao Cheng

Summary: Sequential data clustering is a challenging task in data mining, and subspace clustering is a representative tool for dealing with complex local correlation and high-dimensional structure. It is important to learn a more specific structure representation of a sequence to preserve both sequential information and efficient connections.

PATTERN RECOGNITION (2022)

Article Environmental Sciences

Difference Curvature Multidimensional Network for Hyperspectral Image Super-Resolution

Chi Zhang, Mingjin Zhang, Yunsong Li, Xinbo Gao, Shi Qiu

Summary: The paper introduces a difference curvature multidimensional network for hyperspectral image super-resolution that leverages spectral correlation to enhance spatial resolution. This is achieved through a self-attention mechanism and bottleneck projection to reduce redundancy. Additionally, a difference curvature branch is designed as an edge indicator to preserve texture information and eliminate unwanted noise in high-dimensional space.

REMOTE SENSING (2021)

Article Environmental Sciences

Edge-Preserving Convolutional Generative Adversarial Networks for SAR-to-Optical Image Translation

Jie Guo, Chengyu He, Mingjin Zhang, Yunsong Li, Xinbo Gao, Bangyu Song

Summary: Synthetic aperture radar (SAR) remote sensing is important in modern Earth observation, but interpreting SAR images is challenging. This paper proposes an edge-preserving convolutional generative adversarial network (EPCGAN) to enhance SAR-to-optical image translation by leveraging edge information and implementing content-adaptive convolution. Experiments show that EPCGAN outperforms other methods in recovering structures and yielding superior evaluation results.

REMOTE SENSING (2021)

Article Engineering, Electrical & Electronic

Contrast-Based Unsupervised Hashing Learning With Multi-Hashcode

Xi Zhang, Xiumei Wang, Peitao Cheng

Summary: This article proposes a new strategy based on contrastive learning to improve the performance of unsupervised image retrieval. By fully utilizing the structural information in semantic similarity and employing a novel framework to handle hash codes with different lengths simultaneously, better image retrieval results are achieved.

IEEE SIGNAL PROCESSING LETTERS (2022)

Article Computer Science, Artificial Intelligence

Adaptive Modulation and Rectangular Convolutional Network for Stereo Image Super-Resolution

Xiumei Wang, Tianmeng Li, Zheng Hui, Peitao Cheng

Summary: This paper proposes a deep learning based stereo image super-resolution algorithm that utilizes additional information in stereo image pairs to enhance the quality of reconstructed images. The authors introduce an adaptive modulation alignment mechanism and the use of rectangular convolution kernel to address the challenges caused by occlusion. Experimental results demonstrate that the proposed method achieves state-of-the-art performance on multiple stereo benchmarks.

PATTERN RECOGNITION LETTERS (2022)

Article Environmental Sciences

MSSDet: Multi-Scale Ship-Detection Framework in Optical Remote-Sensing Images and New Benchmark

Weiming Chen, Bing Han, Zheng Yang, Xinbo Gao

Summary: Ships are crucial in ocean transportation, making ship detection an essential technology for marine safety. While optical remote-sensing images are valuable for ship detection, few open datasets exist due to sensitive data issues. The proposed MSSDet framework, utilizing a joint recursive feature pyramid, shows promising results in detecting multi-scale ship objects with improved generalizability and competitive performance compared to state-of-the-art methods.

REMOTE SENSING (2022)

Article Environmental Sciences

TMDiMP: Temporal Memory Guided Discriminative Tracker for UAV Object Tracking

Zheng Yang, Bing Han, Weiming Chen, Xinbo Gao

Summary: Unmanned aerial vehicles (UAVs) have attracted increasing attention in recent years due to their wide range of applications. Object tracking is a critical algorithm for UAVs, but it still faces challenges such as limited textures and contours of UAV objects, and the need to constantly move the camera with the object. In this paper, we propose an end-to-end discriminative tracker called TMDiMP, which incorporates a novel memory-aware attention mechanism to generate discriminative features and overcome the object-forgetting problem. We also introduce a UAV object-tracking dataset named VIPUOTB, which differs from existing datasets in terms of object size, camera motion, and location distribution. Experimental results demonstrate the effectiveness and robustness of our proposed algorithm.

REMOTE SENSING (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Not Just Selection, but Exploration: Online Class-Incremental Continual Learning via Dual View Consistency

Yanan Gu, Xu Yang, Kun Wei, Cheng Deng

Summary: The study proposed a novel and effective framework for online class-incremental continual learning, which not only considers sample selection but also fully explores semantic information in the data stream. By effectively combining gradients and mutual information, state-of-the-art performance was achieved.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Article Computer Science, Artificial Intelligence

Seeking Subjectivity in Visual Emotion Distribution Learning

Jingyuan Yang, Jie Li, Leida Li, Xiumei Wang, Yuxuan Ding, Xinbo Gao

Summary: This study addresses the issue of subjectivity in visual emotion analysis and proposes a novel method to tackle this problem. By simulating the emotion evocation process and incorporating an attention mechanism, the proposed method is able to better predict people's emotions towards different visual stimuli.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Learning the Non-differentiable Optimization for Blind Super-Resolution

Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao

Summary: This study introduces an adaptive modulation network (AMNet) for blind super-resolution (SR) with multiple degradations and incorporates deep reinforcement learning into the entire blind SR model to address non-differentiable issues.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report

Andrey Ignatov, Andres Romero, Heewon Kim, Radu Timofte, Chiu Man Ho, Zibo Meng, Kyoung Mu Lee, Yuxiang Chen, Yutong Wang, Zeyu Long, Chenhao Wang, Yifei Chen, Boshen Xu, Shuhang Gu, Lixin Duan, Wen Li, Wang Bofei, Zhang Diankai, Zheng Chengjian, Liu Shaoli, Gao Si, Zhang Xiaofeng, Lu Kaidi, Xu Tianyu, Zheng Hui, Xinbo Gao, Xiumei Wang, Jiaming Guo, Xueyi Zhou, Hao Jia, Youliang Yan

Summary: Video super-resolution has become increasingly important on mobile devices, but existing solutions are too resource-intensive, leading to the introduction of the first Mobile AI challenge. Participants used the REDS dataset and evaluated their models on the OPPO Find X2 smartphone to develop video super-resolution solutions that achieve real-time performance on any mobile GPU.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 (2021)

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)