4.7 Article

Multi-scale convolutional neural network for multi-focus image fusion

期刊

IMAGE AND VISION COMPUTING
卷 85, 期 -, 页码 26-35

出版社

ELSEVIER
DOI: 10.1016/j.imavis.2019.03.001

关键词

Multi-focus image fusion; Convolutional neural network; Unsupervised; Structure similarity

资金

  1. NSFC, China [61876107, 61572315, U1803261]
  2. 973 Plan, China [2015CB856004]

向作者/读者索取更多资源

In this study, we present new deep learning (DL) method for fusing multi-focus images. Current multi-focus image fusion (MFIF) approaches based on DL methods mainly treat MFIF as a classification task. These methods use a convolutional neural network (CNN) as a classifier to identify pixels as focused or defocused pixels. However, due to unavailability of labeled data to train networks, existing DL-based supervised models for MFIF add Gaussian blur in focused images to produce training data. DL-based unsupervised models are also too simple and only applicable to perform fusion tasks other than MFIF. To address the above issues, we proposed a new MFIF method, which aims to learn feature extraction, fusion and reconstruction components together to produce a complete unsupervised end-to-end trainable deep CNN. To enhance the feature extraction capability of CNN, we introduce a Siamese multi-scale feature extraction module to achieve a promising performance. In our proposed network we applied multiscale convolutions along with skip connections to extract more useful common features from a multi-focus image pair. Instead of using basic loss functions to train the CNN, our model utilizes structure similarity (SSIM) measure as a training loss function. Moreover, the fused images are reconstructed in a multiscale manner to guarantee more accurate restoration of images. Our proposed model can process images with variable size during testing and validation. Experimental results on various test images validate that our proposed method yields better quality fused images that are superior to the fused images generated by compared state-of-the-art image fusion methods. (C) 2019 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

AMIL: Adversarial Multi-instance Learning for Human Pose Estimation

Pourya Shamsolmoali, Masoumeh Zareapoor, Huiyu Zhou, Jie Yang

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2020)

Article Computer Science, Artificial Intelligence

GAN-Poser: an improvised bidirectional GAN model for human motion prediction

Deepak Kumar Jain, Masoumeh Zareapoor, Rachna Jain, Abhishek Kathuria, Shivam Bachhety

NEURAL COMPUTING & APPLICATIONS (2020)

Article Computer Science, Artificial Intelligence

Imbalanced data learning by minority class augmentation using capsule adversarial networks

Pourya Shamsolmoali, Masoumeh Zareapoor, Linlin Shen, Abdul Hamid Sadka, Jie Yang

Summary: This paper proposes a method to address imbalanced image datasets by introducing a competitive game between generator and discriminator networks, which improves learning from imbalanced data. The generator is trained with a feature matching loss function to prevent the generation of outliers and maintain the majority class space.

NEUROCOMPUTING (2021)

Article Computer Science, Artificial Intelligence

Multimodal image fusion based on point-wise mutual information

Donghao Shen, Masoumeh Zareapoor, Jie Yang

Summary: The paper introduces a novel multimodal image fusion algorithm focusing on transferring salient structures and maintaining spatial consistency. The algorithm selects features to transfer using a graph cut algorithm, with spatial varying smoothness cost formulated based on the independence between local features.

IMAGE AND VISION COMPUTING (2021)

Article Engineering, Mechanical

Oversampling adversarial network for class-imbalanced fault diagnosis

Masoumeh Zareapoor, Pourya Shamsolmoali, Jie Yang

Summary: This paper introduces a new adversarial network model for simultaneous classification and fault detection. By generating faulty samples from a mixture of data distribution to restore balance in imbalanced datasets, the proposed model performs well in experiments, particularly in recognizing faulty samples.

MECHANICAL SYSTEMS AND SIGNAL PROCESSING (2021)

Article Computer Science, Artificial Intelligence

Cluster-wise unsupervised hashing for cross-modal similarity search

Lu Wang, Jie Yang, Masoumeh Zareapoor, Zhonglong Zheng

Summary: This paper introduces a novel framework for projecting original data points from different modalities into low-dimensional latent space and finding cluster centroid points using Cluster-wise Unsupervised Hashing (CUH). The framework aims to jointly learn compact hash codes and corresponding linear hash functions, showing superior effectiveness in unsupervised cross-modal hashing tasks compared to state-of-the-art methods.

PATTERN RECOGNITION (2021)

Article Computer Science, Information Systems

Equivariant Adversarial Network for Image-to-image Translation

Masoumeh Zareapoor, Jie Yang

Summary: Image-to-Image translation faces challenges such as lack of paired datasets, multimodality, and diversity. A new variation of generative models using a trainable transformer aims to address these issues by explicitly allowing spatial manipulation of data within training.

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2021)

Article Geochemistry & Geophysics

Road Segmentation for Remote Sensing Images Using Adversarial Spatial Pyramid Networks

Pourya Shamsolmoali, Masoumeh Zareapoor, Huiyu Zhou, Ruili Wang, Jie Yang

Summary: A new model is introduced to apply structured domain adaption for synthetic image generation and road segmentation, incorporating a feature pyramid network into generative adversarial networks to minimize the difference between the source and target domains and improve road extraction accuracy and completeness.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2021)

Article Computer Science, Artificial Intelligence

Image synthesis with adversarial networks: A comprehensive survey and case studies

Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger, Huiyu Zhou, Ruili Wang, M. Emre Celebi, Jie Yang

Summary: Generative Adversarial Networks (GANs) have shown great success in various fields, particularly in image synthesis. This survey provides a comprehensive review of adversarial models for image synthesis, summarizing methods and discussing future research directions. Additionally, all software implementations and datasets of these GAN methods have been collected and made available, which is a unique feature of this review.

INFORMATION FUSION (2021)

Article Geochemistry & Geophysics

Rotation Equivariant Feature Image Pyramid Network for Object Detection in Optical Remote Sensing Imagery

Pourya Shamsolmoali, Masoumeh Zareapoor, Jocelyn Chanussot, Huiyu Zhou, Jie Yang

Summary: The study introduces a novel image pyramid network based on rotation equivariance convolution to tackle the challenge of extracting features for small-scale objects in current object detectors. The proposed model combines a single-shot detector with a lightweight image pyramid module, allowing for feature extraction across various scales and orientations in an optimized manner.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2022)

Article Geochemistry & Geophysics

Multipatch Feature Pyramid Network for Weakly Supervised Object Detection in Optical Remote Sensing Images

Pourya Shamsolmoali, Jocelyn Chanussot, Masoumeh Zareapoor, Huiyu Zhou, Jie Yang

Summary: In this study, a new architecture called MPFP-Net is proposed to address the challenges of object detection in remote sensing images. By dividing patches into class-affiliated subsets and designing a sequence of smooth loss functions, the model is improved to better collect small object parts. The network utilizes bottom-up and crosswise connections to fuse features of different scales for enhanced accuracy, while also being more efficient than baseline models.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2022)

Article Automation & Control Systems

GEN: Generative Equivariant Networks for Diverse Image-to-Image Translation

Pourya Shamsolmoali, Masoumeh Zareapoor, Swagatam Das, Salvador Garcia, Eric Granger, Jie Yang

Summary: Image-to-image translation is crucial in generative adversarial networks. Convolutional neural networks have limitations in capturing spatial relationships, making them unsuitable for image translation tasks. Capsule networks are proposed as a remedy, capturing hierarchical spatial relationships. In this paper, a new framework for capsule networks is presented, which can be applied to generator-discriminator architectures without computational overhead. A Gromov-Wasserstein distance is used as a loss function to guide the learned distribution. The proposed method, called generative equivariant network, is evaluated on I2I translation and image generation tasks and shows a principled connection between generative and capsule models.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

Article Computer Science, Artificial Intelligence

VTAE: Variational Transformer Autoencoder With Manifolds Learning

Pourya Shamsolmoali, Masoumeh Zareapoor, Huiyu Zhou, Dacheng Tao, Xuelong Li

Summary: Deep generative models can learn non-linear data distributions using latent variables and a non-linear generator function. However, the weak projection of the latent space into the data space can result in poor representation learning. This paper proposes a Variational spatial-Transformer AutoEncoder (VTAE) that minimizes geodesics on a Riemannian manifold to improve representation learning.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Salient Skin Lesion Segmentation via Dilated Scale-Wise Feature Fusion Network

Pourya Shamsolmoali, Masoumeh Zareapoor, Jie Yang, Eric Granger, Huiyu Zhou

Summary: This study proposes a dilated scale-wise feature fusion network based on convolution factorization for skin lesion detection in dermoscopic images. The proposed model can extract features at different scales and fuse them for better lesion detection.

2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2022)

Article Computer Science, Information Systems

Asymmetric Correlation Quantization Hashing for Cross-Modal Retrieval

Lu Wang, Masoumeh Zareapoor, Jie Yang, Zhonglong Zheng

Summary: Cross-modal hashing (CMH) is a method that can learn and retrieve similarity across different modalities. However, existing methods have limitations in fully exploiting the underlying properties of multi-modal data and often suffer from significant quantization errors. This paper proposes a novel Asymmetric Correlation Quantization Hashing (ACQH) method to address these challenges, which learns projection matrices and constructs hash codes using semantic similarity preservation and label regression.

IEEE TRANSACTIONS ON MULTIMEDIA (2022)

暂无数据