☆ 4.6 Article

ADCM: attention dropout convolutional module

NEUROCOMPUTING (2020)

期刊

NEUROCOMPUTING

卷 394, 期 -, 页码 95-104

出版社

ELSEVIER

DOI: 10.1016/j.neucom.2020.02.007

关键词

Attention; Dropout-channel; Dropout-region; Network architecture; Convolutional module

类别

Computer Science, Artificial Intelligence

资金

National Natural Science Foundation of China [61502094, 51774090, 51104030]
Heilongjiang Province Natural Science Foundation of China [F2016002, LH2019F042]
Youth Science Foundation of Northeast Petroleum University [2017PYZL-06, 2018YDL-22, KYCXTD201903]
Daqing Science and Technology Project [ZD-2019-08]

向作者/读者索取更多资源

Protocol

Reagent

摘要

Network architecture design plays an important role in boosting the performance of models in various applications. In this work, we design a general and lightweight module named the attention dropout convolutional module (ADCM). It consists of two submodules, channel attention dropout (CAD) and position attention dropout (PAD), and each submodule integrates both attention and dropout mechanisms. The attention mechanism emphasizes the meaningful information and suppresses unnecessary noise. The dropout-channel in the CAD submodule filters the channel based on its channel attention, while the dropout-region in the PAD submodule filters the region consisting of the spatially correlated features according to its position attention. The two dropout methods we designed allow the baseline network to learn more robust features and further boost its performance. Finally, we deploy the ADCM in consecutive layers of classical convolutional neural networks and evaluate its performance on multiple benchmark datasets. The experimental results demonstrate that the ADCM brings significant improvements to the performance of the baseline models at negligible computational cost and with less complexity. (C) 2020 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6

评分不足

次要评分

新颖性

-

重要性

-

科学严谨性

-

评价这篇论文

推荐

Article Computer Science, Information Systems

SCMA: Exploring Dual-Module Attention With Multi-Scale Kernels for Effective Feature Extraction

Shaikh Abdus Samad, J. Gitanjali

Summary: Feature space enrichment is crucial for the development of attention mechanisms in CNNs. The research presents SCMA, an attention mechanism that combines channel and spatial attention to extract features efficiently while balancing parameter efficiency and accuracy.

IEEE ACCESS (2023)

添加到收藏夹

Article Engineering, Geological

LandslideCL: towards robust landslide analysis guided by contrastive learning

Penglei Li, Yi Wang, Guosen Xu, Lizhe Wang

Summary: This study presents a novel robust rainfall-induced landslide detection model guided by contrastive learning. The model combines deep learning techniques, such as residual blocks and channel attention modules, to accurately predict landslide locations. It also utilizes contrastive dice similarity coefficient loss to maintain consistency in landslide regions. Experimental results demonstrate that the proposed model performs excellently, outperforming other classic segmentation methods in crucial criteria.

LANDSLIDES (2023)

添加到收藏夹

Article Engineering, Electrical & Electronic

AVNC: Attention-Based VGG-Style Network for COVID-19 Diagnosis by CBAM

Shui-Hua Wang, Steven Lawrence Fernandes, Ziquan Zhu, Yu-Dong Zhang

Summary: To detect COVID-19 patients more accurately, a 12-layer attention-based VGG-style network called AVNC was proposed, using a chest CT dataset and incorporating attention module and data augmentation method, achieving high sensitivity, precision, and F1 scores.

IEEE SENSORS JOURNAL (2022)

添加到收藏夹

Article Computer Science, Information Systems

Graph convolutional network with triplet attention learning for person re-identification

Shimaa Saber, Khalid Amin, Pawel Plawiak, Ryszard Tadeusiewicz, Mohamed Hammad

Summary: Person re-identification is a method that uses multiple non-overlapping cameras for identification, and it has been successfully applied in computer vision applications. To address issues such as occlusion, illumination changes, and pose changes, a new graph convolutional network with attention modules is proposed. Experimental results demonstrate the high generalization ability and superior performance of the proposed method.

INFORMATION SCIENCES (2022)

添加到收藏夹

Article Thermodynamics

WSFNet: An efficient wind speed forecasting model using channel attention-based densely connected convolutional neural network

Hakan Acikgoz, Umit Budak, Deniz Korkmaz, Ceyhun Yildiz

Summary: This paper introduces a novel deep neural network (WSFNet) for efficiently forecasting multi-step ahead wind speed, incorporating dense connections and channel attention modules, as well as utilizing variational mode decomposition for preprocessing, achieving competitive performance.

ENERGY (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Learning efficient, explainable and discriminative representations for pulmonary nodules classification

Hanliang Jiang, Fuhao Shen, Fei Gao, Weidong Han

Summary: This study aims to build an efficient and (partially) explainable automatic classification model for pulmonary nodules. By using neural architecture search and convolutional block attention module, excellent accuracy/speed trade-off is achieved and helps to understand the reasoning process. Ensemble of diverse neural networks is utilized to improve prediction accuracy and robustness.

PATTERN RECOGNITION (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Orthogonal channel attention-based multi-task learning for multi-view facial expression recognition

Jingying Chen, Lei Yang, Lei Tan, Ruyi Xu

Summary: This paper proposes a novel orthogonal channel attention-based multi-task learning approach for multi-view facial expression recognition. By utilizing a Siamese CNN and a multi-task learning framework, as well as designing a separated channel attention module and an orthogonal channel attention loss, this approach achieves good recognition accuracy on two datasets.

PATTERN RECOGNITION (2022)

添加到收藏夹

Article Engineering, Electrical & Electronic

Single-channel blind separation of co-frequency signals based on convolutional network

Hou Xiaoqi, Gao Yong

Summary: This paper proposes a new waveform separation-demodulation scheme for single-channel blind separation, achieving separation and demodulation through a convolutional time-domain network and low-complexity per-survivor processing method, surpassing other network structures in performance evaluation.

DIGITAL SIGNAL PROCESSING (2022)

添加到收藏夹

Article Environmental Sciences

Multi-Pooling Context Network for Image Semantic Segmentation

Qing Liu, Yongsheng Dong, Zhiqiang Jiang, Yuanhua Pei, Boshi Zheng, Lintao Zheng, Zhumu Fu

Summary: With the development of image segmentation technology, the importance of image context information in semantic segmentation has been recognized. In order to capture rich context information effectively, we proposed a Multi-Pooling Context Network (MPCNet) for image semantic segmentation. The network includes Pooling Context Aggregation Module and Spatial Context Module to capture deep context information and detailed spatial context respectively. Experimental results on multiple datasets demonstrate the effectiveness of our proposed network in context extraction.

REMOTE SENSING (2023)

添加到收藏夹

Article Engineering, Electrical & Electronic

Channel-Wise Correlation Calibrates Attention Module for Convolutional Neural Networks

Ziqiang Lu, Yanwu Dong, Jie Li, Ziying Lu, Pengjie He, Haibo Ru

Summary: This study introduces a new channel attention module LCM, which optimizes the correlation between channel features by integrating global information and channel dependence, showing superiority in experiments.

JOURNAL OF SENSORS (2022)

添加到收藏夹

Article Computer Science, Information Systems

CCNet: CNN model with channel attention and convolutional pooling mechanism for spatial image steganalysis

Tong Fu, Liquan Chen, Zhangjie Fu, Kunliang Yu, Yu Wang

Summary: This paper introduces a new approach for image steganalysis based on convolutional neural networks that focuses on complex regional texture features and improves detection accuracy. Experimental results demonstrate that the proposed model outperforms existing models in terms of detection accuracy.

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

AESPNet: Attention Enhanced Stacked Parallel Network to improve automatic Diabetic Foot Ulcer identification

Sujit Kumar Das, Suyel Namasudra, Awnish Kumar, Nageswara Rao Moparthi

Summary: This paper presents an efficient approach based on Convolutional Neural Network (CNN) called AESPNet for the identification of Diabetic Foot Ulcer (DFU). Compared with other standard CNN-based schemes, AESPNet demonstrates better performance in DFU classification.

IMAGE AND VISION COMPUTING (2023)

添加到收藏夹

Article Multidisciplinary Sciences

Sound source localization based on residual network and channel attention module

Fucai Hu, Xiaohui Song, Ruhan He, Yongsheng Yu

Summary: This paper proposes a sound source localization (SSL) model based on residual network and channel attention mechanism. The method uses log-Mel spectrogram and GCC-PHAT as input features, and extracts time-frequency information using the residual structure and channel attention mechanism, resulting in improved localizing performance.

SCIENTIFIC REPORTS (2023)

添加到收藏夹

Article Automation & Control Systems

BOLD-net: Brightness enhancement for old images using deep curve estimation and attention modules

Arshiana Shamir, Nokap Park, Bumshik Lee

Summary: A novel deep-learning network is proposed for brightness enhancement of old images, which combines curve map estimation and attention-guided illumination map to adjust the dynamic range and illumination of the images. Experimental results show that the proposed method outperforms existing methods in brightness enhancement on old photo and video datasets.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2023)

添加到收藏夹

Article Environmental Sciences

A Distributed Fusion Framework of Multispectral and Panchromatic Images Based on Residual Network

Yuanyuan Wu, Mengxing Huang, Yuchun Li, Siling Feng, Di Wu

Summary: This study introduces a pan-sharpening method combining remote sensing images with CNN, proposing a distributed fusion framework based on residual CNN, RDFNet, to improve image resolution and preserve spectral information. Experimental results show that RDFNet performs superiorly in enhancing spatial resolution and fusion quality.

REMOTE SENSING (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Small traffic sign detection from large image

Zhigang Liu, Dongyu Li, Shuzhi Sam Ge, Feng Tian

APPLIED INTELLIGENCE (2020)

添加到收藏夹

Article Engineering, Electrical & Electronic

Traffic Sign Recognition Using an Attentive Context Region-Based Detection Framework

Liu Zhigang, Du Juan, Tian Feng, Wen Jiazheng

Summary: Accurate recognition of small traffic signs is crucial for the safety of intelligent transportation systems. A novel recognition framework named attentive context region-based detection framework (AC-RDF) is proposed in this paper, which utilizes attentive context feature and attentive loss function to improve recognition accuracy. Experimental results on the Tsinghua-Tencent 100K dataset demonstrate the superiority of the proposed framework in detecting small traffic signs and achieving state-of-the-art performance.

CHINESE JOURNAL OF ELECTRONICS (2021)

添加到收藏夹

Article Computer Science, Information Systems

MR-CNN: A Multi-Scale Region-Based Convolutional Neural Network for Small Traffic Sign Recognition

Zhigang Liu, Juan Du, Feng Tian, Jiazheng Wen

IEEE ACCESS (2019)

添加到收藏夹

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

添加到收藏夹

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)

添加到收藏夹

© Peeref 2019-2024. All rights reserved.