4.7 Article

SDNet: A Versatile Squeeze-and-Decomposition Network for Real-Time Image Fusion

期刊

INTERNATIONAL JOURNAL OF COMPUTER VISION
卷 129, 期 10, 页码 2761-2785

出版社

SPRINGER
DOI: 10.1007/s11263-021-01501-8

关键词

Image fusion; Real time; Adaptive; Proportion; Squeeze decomposition

向作者/读者索取更多资源

This paper introduces a squeeze-and-decomposition network (SDNet) for real-time multi-modal and digital photography image fusion. By transforming fusion problems into extraction and reconstruction of gradient and intensity information, and introducing the squeeze and decomposition concept into image fusion, the method outperforms state-of-the-art techniques in subjective visual effect and quantitative metrics in various fusion tasks, while also being much faster for real-time fusion tasks.
In this paper, a squeeze-and-decomposition network (SDNet) is proposed to realize multi-modal and digital photography image fusion in real time. Firstly, we generally transform multiple fusion problems into the extraction and reconstruction of gradient and intensity information, and design a universal form of loss function accordingly, which is composed of intensity term and gradient term. For the gradient term, we introduce an adaptive decision block to decide the optimization target of the gradient distribution according to the texture richness at the pixel scale, so as to guide the fused image to contain richer texture details. For the intensity term, we adjust the weight of each intensity loss term to change the proportion of intensity information from different images, so that it can be adapted to multiple image fusion tasks. Secondly, we introduce the idea of squeeze and decomposition into image fusion. Specifically, we consider not only the squeeze process from source images to the fused result, but also the decomposition process from the fused result to source images. Because the quality of decomposed images directly depends on the fused result, it can force the fused result to contain more scene details. Experimental results demonstrate the superiority of our method over the state-of-the-arts in terms of subjective visual effect and quantitative metrics in a variety of fusion tasks. Moreover, our method is much faster than the state-of-the-arts, which can deal with real-time fusion tasks.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Automation & Control Systems

Loop Closure Detection via Locality Preserving Matching With Global Consensus

Jiayi Ma, Kaining Zhang, Junjun Jiang

Summary: A novel appearance-based loop closure detection system is proposed in this work, which selects candidate frames using Super-features and ASMK, and verifies loops using LPM-GC algorithm. Experimental results demonstrate that the proposed method achieves good performance in loop closure detection task.

IEEE-CAA JOURNAL OF AUTOMATICA SINICA (2023)

Article Automation & Control Systems

Sparse Tensor Prior for Hyperspectral, Multispectral, and Panchromatic Image Fusion

Xin Tian, Wei Zhang, Dian Yu, Jiayi Ma

Summary: This letter proposes a new hyperspectral fusion paradigm that simultaneously fuses hyperspectral, multispectral, and panchromatic images. It introduces a novel sparse tensor prior using patch-based sparse tensor dictionary learning to better describe the inherent structures of high-resolution hyperspectral images. The effectiveness of the proposed method is validated through numerous experiments in terms of visual quality and quantitative analysis.

IEEE-CAA JOURNAL OF AUTOMATICA SINICA (2023)

Article Computer Science, Artificial Intelligence

JRA-Net: Joint representation attention network for correspondence learning

Ziwei Shi, Guobao Xiao, Linxin Zheng, Jiayi Ma, Riqing Chen

Summary: In this paper, a novel Joint Representation Attention Network (JRA-Net) is proposed to establish reliable correspondences for image pairs. The attention mechanism and weight function are used to improve the reliability of correspondences and enhance the generalization ability. Empirical experiments demonstrate the effectiveness and superiority of JRA-Net.

PATTERN RECOGNITION (2023)

Article Computer Science, Artificial Intelligence

Hierarchical image peeling: A flexible scale-space filtering framework

Yuanbin Fu, Jiayi Ma, Xiaojie Guo

Summary: This paper presents a hierarchical image organization framework based on scale-space perspective. It converts the original complex problem into a series of two-component separation sub-problems, and provides theoretical and experimental results to demonstrate its effectiveness and superiority.

COMPUTER VISION AND IMAGE UNDERSTANDING (2023)

Article Computer Science, Artificial Intelligence

Improving sparse graph attention for feature matching by informative keypoints exploration

Xingyu Jiang, Shihua Zhang, Xiao-Ping Zhang, Jiayi Ma

Summary: This paper proposes an enhanced sparse GNN method using Guided Attentional Pooling to emphasize informative cues in GNN layers. It extracts information-rich keypoints and uses them to guide attentional pooling for better information preservation. Experimental results show that our method outperforms existing techniques in tasks such as camera pose estimation, fundamental matrix estimation, and visual localization.

COMPUTER VISION AND IMAGE UNDERSTANDING (2023)

Article Automation & Control Systems

Neighborhood Manifold Preserving Matching for Visual Place Recognition

Xinyu Ye, Jiayi Ma

Summary: This article proposes an effective and efficient visual place recognition (VPR) approach that integrates semantic, sequential, and spatial geometric information. The focus is on candidate selection and geometric verification, rather than feature extraction. The proposed method, neighborhood manifold preserving matching (NMP), utilizes sequence partitioning and sequence-to-sequence matching to improve VPR performance. Experimental results demonstrate the superiority of the proposed VPR method and its potential for integration with other pipelines.

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS (2023)

Article Computer Science, Artificial Intelligence

Hyperspectral image denoising via spectral noise distribution bootstrap

Erting Pan, Yong Ma, Xiaoguang Mei, Fan Fan, Jiayi Ma

Summary: Hyperspectral image denoising is a challenging problem, and prior knowledge about hyperspectral noise is essential for developing an effective denoising method. Most existing methods assume equal noise intensity in all bands, which contradicts practical HSIs and leads to unsatisfactory results. To address this, we propose a novel denoising framework called (N) over cap-Net, which utilizes the intrinsic properties of real HSI noise and employs a bootstrap mechanism for better denoising performance.

PATTERN RECOGNITION (2023)

Article Computer Science, Artificial Intelligence

Variational Bayesian deep network for blind Poisson denoising

Hao Liang, Rui Liu, Zhongyuan Wang, Jiayi Ma, Xin Tian

Summary: Deep learning-based approaches have achieved significant results in Poisson denoising under low-light conditions. However, most existing methods focus on network architecture design without physical interpretability, making them unsuitable for blind denoising in real environments. To address this, the authors propose VBDNet, a variational Bayesian deep network that combines Bayesian inference and deep learning for blind Poisson denoising. VBDNet outperforms state-of-the-art methods on synthetic and natural data.

PATTERN RECOGNITION (2023)

Article Geochemistry & Geophysics

Coarse-to-Fine Cross-Domain Learning Fusion Network for Pansharpening

Chengjie Ke, Wei Zhang, Zhongyuan Wang, Jiayi Ma, Xin Tian

Summary: We propose a coarse-to-fine adaptation learning fusion network for pansharpening, which combines the advantages of UNet and Transformer architectures to explore texture information of different characteristics. The network adjusts the spatial and spectral information of the coarse-fusion image based on target-specific knowledge in an unsupervised learning manner. Experiments show that our proposed method outperforms other state-of-the-art DL methods in terms of visual quality and quantitative analysis.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Article Computer Science, Artificial Intelligence

Hyperspectral image destriping and denoising from a task decomposition view

Erting Pan, Yong Ma, Xiaoguang Mei, Jun Huang, Qihai Chen, Jiayi Ma

Summary: In this article, a novel approach for denoising and destriping HSI is proposed. By decomposing the task into auxiliary sub-tasks, the shortcomings of the generalized mathematical model are addressed, leading to accurate destriping and high-fidelity restoration.

PATTERN RECOGNITION (2023)

Article Computer Science, Artificial Intelligence

Learning a 3D-CNN and Transformer prior for hyperspectral image-resolution

Qing Ma, Junjun Jiang, Xianming Liu, Jiayi Ma

Summary: To address the problem of hyperspectral image super-resolution, this paper proposes a novel method that uses the Transformer architecture to learn the prior of hyperspectral images (HSIs). By adding a 3D-CNN behind the Transformer layers, the method aims to capture the spatio-spectral correlation of HSIs. Experimental results demonstrate that the proposed method outperforms existing algorithms in terms of quantitative and visual quality.

INFORMATION FUSION (2023)

Article Geochemistry & Geophysics

Progressive Hyperspectral Image Destriping With an Adaptive Frequencial Focus

Erting Pan, Yong Ma, Xiaoguang Mei, Fan Fan, Jun Huang, Jiayi Ma

Summary: This study proposes a progressive hyperspectral destriping method with an adaptive frequency focus for accurate destriping and delicate restoration. The method encodes the degraded input to the frequency domain with smaller scales and separates noise and preserves details in the high-frequency domain. The experimental results demonstrate the superiority of the proposed method over the current state-of-the-art methods.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Article Geochemistry & Geophysics

Multipatch Progressive Pansharpening With Knowledge Distillation

Meiqi Gong, Hao Zhang, Han Xu, Xin Tian, Jiayi Ma

Summary: In this article, a novel multipatch and multistage pansharpening method called PSDNet is proposed, which utilizes knowledge distillation. The method incorporates multipatch inputs and a multistage network for more accurate learning. Small patches are used in the early part to learn accurate local information, while large patches are employed later to fine-tune for overall information. The multistage network reduces the difficulty of single-step pansharpening and generates elaborate results progressively. Distillation loss is introduced to reinforce the guidance of the ground truth, leading to superior performance compared to existing state-of-the-art methods. The code for PSDNet is available at https://github.com/Meiqi-Gong/PSDNet.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Article Computer Science, Artificial Intelligence

Zero-Sharpen: A universal pansharpening method across satellites for reducing scale-variance gap via zero-shot variation

Hebaixu Wang, Hao Zhang, Xin Tian, Jiayi Ma

Summary: Pansharpening is a technique that combines a high-resolution panchromatic image and a low-resolution multi-spectral image to generate a high-resolution multi-spectral image. The proposed Zero-Sharpen method, which combines deep learning and variational optimization, can be easily applied across different satellites and reduces scale variance. Extensive experiments demonstrate the superiority of this method over existing ones.

INFORMATION FUSION (2024)

Article Computer Science, Theory & Methods

Deep Learning-based Face Super-resolution: A Survey

Junjun Jiang, Chenyang Wang, Xianming Liu, Jiayi Ma

Summary: This survey systematically reviews deep learning-based face super-resolution (FSR) methods. It summarizes the problem formulation of FSR, introduces assessment metrics and loss functions. It elaborates on facial characteristics and popular datasets used in FSR, and categorizes existing methods based on the utilization of facial characteristics. For each category, it provides a general description of design principles, an overview of representative approaches, and discusses their pros and cons. The survey also evaluates the performance of state-of-the-art methods and introduces joint FSR and other tasks, as well as FSR-related applications, while envisioning future technological advancements in this field.

ACM COMPUTING SURVEYS (2023)

暂无数据