☆ 4.6 Article

Split-guidance network for salient object detection

VISUAL COMPUTER (2023)

期刊

VISUAL COMPUTER

卷 39, 期 4, 页码 1437-1451

出版社

SPRINGER

DOI: 10.1007/s00371-022-02421-5

关键词

Salient object detection; Split-guidance convolution; Unified decoder

类别

Computer Science, Software Engineering

向作者/读者索取更多资源

Protocol

Reagent

智能总结 New
摘要

A simple yet efficient split-guidance convolution block is proposed to improve the multi-scale representation ability and a unified decoder for both RGB SOD and RGB-D SOD is built.

Due to the large-scale variation in objects in practical scenes, multi-scale representation is of critical importance for salient object detection (SOD). Recent advances in multi-level feature fusion also demonstrate its contribution in consistent performance gains. Different from the existing layer-wise methods, we propose a simple yet efficient split-guidance convolution block to improve the multi-scale representation ability at a granular level in this paper. Specifically, the input feature is first split into different subsets; each of them is guided by all the subsets in front of it, in this way to increase the range of receptive fields for each network layer. By embedding it into each side-output stage of the encoder, we build a unified decoder for both RGB SOD and RGB-D SOD. Experimental results on five RGB datasets, five RGB-D datasets and three RGB-T datasets demonstrate that the proposed method without any attention mechanisms and other complex designs performs favorably against state-of-the-art approaches and also shows advantages in simplicity, efficiency and compactness.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6

评分不足

次要评分

新颖性

-

重要性

-

科学严谨性

-

评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

Rethinking Lightweight Salient Object Detection via Network Depth-Width Tradeoff

Jia Li, Shengye Qiao, Zhirui Zhao, Chenxi Xie, Xiaowu Chen, Changqun Xia

Summary: This article introduces a lightweight framework for salient object detection, which addresses the dilution of semantic context, loss of spatial structure, and absence of boundary detail by decoupling the U-shape structure into three branches. The proposed Scale-Adaptive Pooling Module is used to obtain multi-scale receptive field. Experimental results demonstrate that the method achieves a better balance between efficiency and accuracy.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

添加到收藏夹

Article Computer Science, Information Systems

Spatial frequency enhanced salient object detection

Xiaofang Li, Yi Wang, Tianzhu Wang, Ruili Wang

Summary: In this work, an effective and flexible spatial frequency enhancement (SFE) module based on generalized Oct-convolution is proposed. It can extract and incorporate multiple spatial frequency information from different feature maps and output comprehensive and compact frequency features. A spatial frequency enhanced network (SFENet) is then designed, which adopts two SFE modules to refine high- and low-frequency salient features and integrates them into a final full-band saliency prediction.

INFORMATION SCIENCES (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Weakly Alignment-Free RGBT Salient Object Detection With Deep Correlation Network

Zhengzheng Tu, Zhun Li, Chenglong Li, Jin Tang

Summary: In this study, we propose a novel deep correlation network for RGBT Salient Object Detection (SOD). The network explores the correlations between RGB and thermal modalities, and incorporates a modality alignment module and a bi-directional decoder model to handle unaligned image pairs and enhance feature representation. Experimental results show that our method outperforms state-of-the-art methods on three benchmark datasets.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

添加到收藏夹

Article Computer Science, Information Systems

Multi-Guidance CNNs for Salient Object Detection

Shuaixiong Hui, Qiang Guo, Xiaoyu Geng, Caiming Zhang

Summary: Feature refinement and fusion are crucial steps in SOD. This article proposes MGuid-Net, a novel multi-guidance SOD model that utilizes multiple guidance mechanisms. It incorporates edge features alongside saliency features and includes self-guidance and cross-guidance modules to refine and fuse the features. The model also incorporates an accumulative guidance module and a pixelwise contrast loss function to better integrate and retain details. Experimental results show that the proposed model outperforms state-of-the-art models on benchmark datasets.

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2023)

添加到收藏夹

Article Computer Science, Information Systems

Boundary Information Progressive Guidance Network for Salient Object Detection

Zhaojian Yao, Luping Wang

Summary: This research focuses on the use of boundary information in saliency detection and proposes a new network structure to generate more accurate saliency maps by learning boundary features. Experimental results demonstrate that the proposed method outperforms 15 state-of-the-art methods on benchmark datasets.

IEEE TRANSACTIONS ON MULTIMEDIA (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Edge-aware salient object detection network via context guidance

Xiaowei Chen, Qing Zhang, Liqian Zhang

Summary: The proposed edge-aware salient object detection network utilizes high-level semantic information to assist feature selection and locates salient objects by extracting multi-scale features and emphasizing important feature channels. It adopts a context guidance strategy to fuse high-level and low-level information and supervises the generation of low-level edge information.

IMAGE AND VISION COMPUTING (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

LC3Net: Ladder context correlation complementary network for salient object detection

Xian Fang, Jinchao Zhu, Xiuli Shao, Hongpeng Wang

Summary: In this paper, we propose a novel network model LC(3)Net, equipped with the components of FCB, DCM, and BCD, to address the issues in utilizing contextual information. Extensive experiments demonstrate the superior performance of our method compared to 20 state-of-the-art methods.

KNOWLEDGE-BASED SYSTEMS (2022)

添加到收藏夹

Article Computer Science, Hardware & Architecture

Heatmap and edge guidance network for salient object detection

Botong Zhang, Lihua Tian, Chen Li, Yi Yang

Summary: In this paper, a novel network called HENet is proposed to achieve better prediction results by extracting and utilizing features of different layers. The feature extraction module and multi-layer feature supplementary module are used to obtain location and detailed information. Furthermore, the trisection dilated convolution module is proposed to expand the receptive field of features. Experimental results demonstrate the superiority of our method on 4 datasets.

COMPUTERS & ELECTRICAL ENGINEERING (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

FGNet: Fixation guidance network for salient object detection

Junbin Yuan, Lifang Xiao, Kanoksak Wattanachote, Qingzhen Xu, Xiaonan Luo, Yongyi Gong

Summary: In this paper, a fixation guidance network (FGNet) is proposed for salient object detection, which utilizes fixation prediction to guide both salient object detection and edge detection. The network consists of a multi-branch structure for multi-task detection, a fixation guidance module to guide detection, and a multi-resolution feature interaction module for optimizing the representations. Experimental results show that the proposed method outperforms existing algorithms.

NEURAL COMPUTING & APPLICATIONS (2023)

添加到收藏夹

Article Automation & Control Systems

Unidirectional RGB-T salient object detection with intertwined driving of encoding and fusion

Jie Wang, Kechen Song, Yanqi Bao, Yunhui Yan, Yahong Han

Summary: This paper introduces a unidirectional RGB-T salient object detection network with intertwined driving of encoding and fusion. By using transformer as the network backbone, it solves the problem of CNNs' difficulty in establishing long-range dependencies. Furthermore, by constructing a unidirectional architecture and using local detail-driven modules, it improves the drawbacks of the encoder-decoder architecture and enhances the performance of the network.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Aggregate interactive learning for RGB-D salient object detection

Jingyu Wu, Fuming Sun, Rui Xu, Jie Meng, Fasheng Wang

Summary: This paper proposes a strategy of aggregation and interaction to extract edge features, depth features, and salient features while maintaining local details and fully extracting global information. By extracting and fusing features in the learning process, it addresses the issues of multi-scale problem and information redundancy, achieving excellent performance in salient object detection.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Three-stream interaction decoder network for RGB-thermal salient object detection

Fushuo Huo, Xuegui Zhu, Bingheng Li

Summary: This paper proposes the Three-stream Interaction Decoder Network (TIDNet) for the RGB-T SOD task. By utilizing a three-stream interaction decoder in the encoder branches, we are able to explore saliency in depth and capture salient cues from both single and multi-modalities. Our method outperforms state-of-the-art methods in comprehensive experiments.

KNOWLEDGE-BASED SYSTEMS (2022)

添加到收藏夹

Article Engineering, Civil

TFGNet: Traffic Salient Object Detection Using a Feature Deep Interaction and Guidance Fusion

Ning Jia, Yougang Sun, Xianhui Liu

Summary: This paper proposes a traffic salient object detection method that can detect complete objects that attract human attention in natural traffic scenes, providing assistance for target recognition tasks in the domain of intelligent driving.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2023)

添加到收藏夹

Article Engineering, Electrical & Electronic

Bi-Directional Progressive Guidance Network for RGB-D Salient Object Detection

Yang Yang, Qi Qin, Yongjiang Luo, Yi Liu, Qiang Zhang, Jungong Han

Summary: This paper presents a Bi-directional Progressive Guidance Network (BPGNet) for RGB-D salient object detection, which involves the qualities of RGB and depth images. The network employs a bi-directional framework based on progressive guidance strategy to extract and enhance unimodal features, in order to address the impact of RGB image quality on saliency detection.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Asymmetric cross-modal activation network for RGB-T salient object detection

Chang Xu, Qingwu Li, Qingkai Zhou, Xiongbiao Jiang, Dabing Yu, Yaqin Zhou

Summary: RGB-thermal salient object detection has unique advantages in handling challenging scenes, but existing methods often overlook the differences between imaging mechanisms and thermal image characteristics, resulting in unsatisfactory performance. To address this, an asymmetric cross-modal activation network is proposed to achieve more effective RGB-T SOD by exploiting the interactions of modality-specific features.

KNOWLEDGE-BASED SYSTEMS (2022)

添加到收藏夹

暂无数据

暂无数据

© Peeref 2019-2024. All rights reserved.