4.7 Article

EDRNet: Encoder-Decoder Residual Network for Salient Object Detection of Strip Steel Surface Defects

Journal

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT
Volume 69, Issue 12, Pages 9709-9719

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIM.2020.3002277

Keywords

Feature extraction; Saliency detection; Decoding; Residual neural networks; Strips; Steel; Semantics; Encoder– Decoder; residual refinement structure; salient object detection; surface defects

Funding

  1. National Natural Science Foundation of China [51805078, 51374063]
  2. National Key Research and Development Program of China [2017YFB0304200]

Ask authors/readers for more resources

It is still a challenging task to detect the surface defects of strip steel due to its complex variations, including variable defect types, cluttered background, low contrast, and noise interference. The existing detection methods cannot effectively segment the defect objects from complex background and have poor real-time performance. To address these issues, we propose a novel saliency detection method based on Encoder-Decoder Residual network (EDRNet). In the encoder stage, we use a fully convolutional neural network to extract rich multilevel defect features and fuse the attention mechanism to accelerate the convergence of the model. Then in the decoder stage, we adopt the channels weighted block (CWB) and the residual decoder block (RDB) alternatively to integrate the spatial features of shallower layers and semantic features of deep layers and recover the predicted spatial saliency values step by step. Finally, we design the residual refinement structure with 1D filters (RRS_1D) to further optimize the coarse saliency map. Compared with the existing saliency detection methods, the deeply supervised EDRNet can accurately segment the complete defect objects with well-defined boundary and effectively filter out irrelevant background noise. The extensive experimental results prove that our method is consistently superior to the state-of-the-art methods with large margins and strong robustness, and the detection efficiency is at over 27 fps on a single GPU.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Automation & Control Systems

Thermal images-aware guided early fusion network for cross-illumination RGB-T salient object detection

Han Wang, Kechen Song, Liming Huang, Hongwei Wen, Yunhui Yan

Summary: RGB-T salient object detection has achieved rapid development and excellent results in recent years. However, the current RGB-T datasets lack low-illumination data, leading to poor performance in detecting salient objects in extremely low-illumination scenes. To address this issue, we propose a T-aware guided early fusion network that leverages thermal images to enhance the detection performance of low-illumination data.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2023)

Review Computer Science, Artificial Intelligence

Data-driven robotic visual grasping detection for unknown objects: A problem-oriented review

Hongkun Tian, Kechen Song, Song Li, Shuai Ma, Jing Xu, Yunhui Yan

Summary: This paper presents a comprehensive survey of data-driven robotic visual grasping detection (DRVGD) for unknown objects. It reviews both object-oriented and scene-oriented aspects, providing detailed information about associated grasping representations and datasets. The challenges of DRVGD and future directions are also pointed out.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Engineering, Multidisciplinary

FaNet: Feature-aware network for few shot classification of strip steel surface defects

Wenli Zhao, Kechen Song, Yanyan Wang, Shubo Liang, Yunhui Yan

Summary: This paper proposes a feature-aware network (FaNet) for few shot defect classification, which can effectively distinguish new classes with a small number of labeled samples. In FaNet, ResNet12 is used as the baseline, and the feature-attention convolution module (FAC) is applied to extract comprehensive feature information from the base classes. An online feature-enhance integration module (FEI) is adopted during the test phase to average the noise from defect images, further enhancing image features among different tasks. In addition, a large-scale strip steel surface defects few shot classification dataset (FSC-20) with 20 different types is constructed. Experimental results show that the proposed method achieves the best performance compared to state-of-the-art methods for 5-way 1-shot and 5-way 5-shot tasks. The dataset and code are available at: https://github.com/VDT-2048/FSC-20.

MEASUREMENT (2023)

Article Chemistry, Multidisciplinary

Study of the interaction mechanism of silver nanoparticles with γ-globulin, fibrinogen and hyaluronidase

Xiangrong Li, Zeqing Cheng, Ruonan Xu, Ziyang Wang, Li Shi, Yunhui Yan

Summary: Spherical silver nanoparticles (AgNPs) with a mean diameter of 50.4 nm were prepared using sodium citrate reduction. The interaction mechanism between AgNPs and gamma-globulin, fibrinogen, and hyaluronidase (HAase) was investigated. The results showed that AgNPs effectively quenched the intrinsic fluorescence of gamma-globulin, fibrinogen, and HAase through a static quenching mechanism. The binding constant and Hill coefficient indicated the order of interaction strength to be fibrinogen-AgNPs > gamma-globulin-AgNPs > HAase-AgNPs. The interaction between gamma-globulin/fibrinogen and AgNPs was driven by enthalpy and hydrophobic interaction, while the interaction between HAase and AgNPs was driven by entropy and van der Waals force and hydrogen bonding.

NEW JOURNAL OF CHEMISTRY (2023)

Article Computer Science, Artificial Intelligence

Informed anytime Bi-directional Fast Marching Tree for optimal motion planning in complex cluttered environments

Kuan Wang, Jing Xu, Kechen Song, Yunhui Yan, Yihang Peng

Summary: This paper proposes Informed Anytime Bi-directional Fast Marching Tree (IABFMT*), an anytime asymptotically-optimal sampling-based algorithm that combines the strengths of BFMT* and IAFMT*. It performs a bi-directional lazy search to efficiently find a feasible solution and improve it quickly. Additionally, graph pruning and heuristic cost evaluation techniques are implemented to reduce unnecessary computations and improve convergence rate. Simulation results in OMPL demonstrate the superior efficiency of IABFMT* compared to other state-of-the-art algorithms in complex cluttered environments.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Automation & Control Systems

A Novel Visible-Depth-Thermal Image Dataset of Salient Object Detection for Robotic Visual Perception

Kechen Song, Jie Wang, Yanqi Bao, Liming Huang, Yunhui Yan

Summary: Visual perception is crucial for industrial information field, specifically in robotic grasping application. To achieve fast and accurate object detection for grasping, salient object detection (SOD) is employed. However, existing SOD methods still have limitations in practical application due to complex interference. To address this, a novel triple-modal images fusion strategy called visible-depth-thermal (VDT) SOD is proposed. Experimental results demonstrate that our method outperforms state-of-the-art approaches.

IEEE-ASME TRANSACTIONS ON MECHATRONICS (2023)

Article Computer Science, Information Systems

Exploring the potential of Siamese network for RGBT object tracking

Liangliang Feng, Kechen Song, Junyi Wang, Yunhui Yan

Summary: Siamese tracking is a promising object tracking method that aims to improve robustness by introducing infrared data as an aid. However, current RGBT trackers have limitations in terms of operational efficiency. In this paper, an end-to-end Siamese RGBT tracking framework is proposed, which utilizes cross-modal feature enhancement and self-attention to effectively exploit the potential of Siamese tracking. The proposed framework achieved state-of-the-art performance on benchmark datasets while running in real-time.

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (2023)

Article Computer Science, Information Systems

A Delay-Optimal Task Scheduling Strategy for Vehicle Edge Computing Based on the Multi-Agent Deep Reinforcement Learning Approach

Xuefang Nie, Yunhui Yan, Tianqing Zhou, Xingbang Chen, Dingding Zhang

Summary: Cloudlet-based vehicular networks are proposed to enhance computation services by using a distributed computation method. A parallel task scheduling strategy based on multi-agent deep reinforcement learning (DRL) approach is presented to further improve the computing efficiency and reduce the task processing delay. The experiment results demonstrate that the proposed DRL-based scheduling algorithm achieves significant performance improvement compared with traditional task scheduling algorithms.

ELECTRONICS (2023)

Article Instruments & Instrumentation

DASR: Dual-Attention Transformer for infrared image super-resolution

Shubo Liang, Kechen Song, Wenli Zhao, Song Li, Yunhui Yan

Summary: The infrared image super-resolution (SR) method improves the quality and efficiency of infrared cameras by reconstructing higher-resolution images. Existing methods overlook the specificity of infrared images and focus on small-scale factors. To address this, a novel infrared SR model called DASR is proposed, which incorporates a Transformer with spatial and channel dual-attention mechanisms to capture global edge structure information. Experimental results demonstrate that DASR outperforms state-of-the-art methods in terms of both visual quality and computational efficiency.

INFRARED PHYSICS & TECHNOLOGY (2023)

Article Engineering, Electrical & Electronic

Autocorrelation-Aware Aggregation Network for Salient Object Detection of Strip Steel Surface Defects

Wenqi Cui, Kechen Song, Hu Feng, Xiujian Jia, Shaoning Liu, Yunhui Yan

Summary: Researchers propose a novel autocorrelation-aware aggregation network (A3Net) for salient object detection of strip steel surface defects. The use of attention mechanism and scale interaction module contributes to the superior performance of the proposed method on both public and newly built datasets.

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT (2023)

Article Chemistry, Analytical

Self-Enhanced Mixed Attention Network for Three-Modal Images Few-Shot Semantic Segmentation

Kechen Song, Yiming Zhang, Yanqi Bao, Ying Zhao, Yunhui Yan

Summary: Image segmentation is an important computer vision technique that has been widely used in various tasks. In extreme cases with insufficient illumination, the performance of the model can be greatly affected, leading to the use of multi-modal images in fully supervised methods. Obtaining dense annotated large datasets is difficult, but satisfactory results can still be achieved with few-shot methods and few pixel-annotated samples. Therefore, a Visible-Depth-Thermal (three-modal) images few-shot semantic segmentation method is proposed in this study to improve the performance of few-shot segmentation tasks by utilizing the homogeneous and complementary information of three-modal images.

SENSORS (2023)

Article Engineering, Electrical & Electronic

Cross Position Aggregation Network for Few-Shot Strip Steel Surface Defect Segmentation

Hu Feng, Kechen Song, Wenqi Cui, Yiming Zhang, Yunhui Yan

Summary: This article proposes a simple and effective few-shot segmentation method called CPANet, which aims to learn a network that can segment untrained S3D categories with only a few labeled defective samples. CPANet effectively aggregates long-range relationships of discrete defects using CPP and SA modules. It also introduces an SSA module to aggregate multiscale context information of defect features and suppresses interference from background information. Extensive experiments demonstrate that CPANet achieves state-of-the-art performance on the FSSD-12 dataset.

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT (2023)

Article Computer Science, Artificial Intelligence

Feature-based domain disentanglement and randomization: A generalized framework for rail surface defect segmentation in unseen scenarios

Shuai Ma, Kechen Song, Menghui Niu, Hongkun Tian, Yanyan Wang, Yunhui Yan

Summary: This paper proposes a feature-based domain disentanglement and randomization (FDDR) framework to improve the generalization of deep models in unseen datasets. The framework successfully addresses the appearance difference issue between training and test images by decomposing the defect image into domain-invariant structural features and domain-specific style features. It also utilizes randomly generated samples for training to further expand the training sample.

ADVANCED ENGINEERING INFORMATICS (2024)

Article Engineering, Multidisciplinary

Balanced multi-scale target score network for ceramic tile surface defect detection

Tonglei Cao, Kechen Song, Likun Xu, Hu Feng, Yunhui Yan, Jingbo Guo

Summary: This study constructs a high-resolution dataset for surface defects in ceramic tiles and addresses the scale and quantity differences in defect distribution. An improved approach is proposed by introducing a content-aware feature recombination method and a dynamic attention mechanism. Experimental results demonstrate the superior accuracy and efficiency of the proposed method.

MEASUREMENT (2024)

No Data Available