4.7 Article

Semantic Context-Aware Network for Multiscale Object Detection in Remote Sensing Images

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LGRS.2021.3067313

Keywords

Semantics; Feature extraction; Remote sensing; Object detection; Kernel; Proposals; Pipelines; Multiscale object; receptive field; remote sensing images; semantic context

Funding

  1. National Natural Science Foundation of China [61976179]
  2. Fundamental Research Funds for the Central Universities [3102019HTXM005, 3102017HQZZ003]
  3. Key Industrial Innovation Chain Project in the Industrial Domain of Key Research and Development Program of Shaanxi Province [2018ZDCXLGY030203]

Ask authors/readers for more resources

This paper proposes a semantic context-aware network (SCANet) model for multiscale object detection, with the use of receptive field-enhancement module (RFEM) and semantic context fusion module (SCFM) to enhance performance. Experimental results show that SCANet achieves superior detection results on the DOTA-v1.5 dataset compared to state-of-the-art approaches.
Accurate object detection in remote sensing images is an essential part of automatic extraction, analysis, and understanding of image information, which potentially plays a significant role in a number of practical applications. However, the scale diversity in remote sensing images presents a substantial challenge for object detection, regarded as one of the crucial problems to be solved. To extract multiscale feature representations and sufficiently exploit semantic context information, this letter proposes a semantic context-aware network (SCANet) model for multiscale object detection. We propose two novel modules, called receptive field-enhancement module (RFEM) and semantic context fusion module (SCFM), to enhance the performance of SCANet. The RFEM dedicates to more robust multiscale feature extraction by paying attention to distinct receptive fields through multibranch different convolutions. For the purpose of utilizing the semantic context information contained in the scene to guide the network to better detection accuracy, the SCFM integrates the semantic context features from the upper level with the lower level features and delivers them hierarchically. Experiments demonstrate that, compared with the state-of-the-art approaches, the SCANet yields superior detection results on the DOTA-v1.5 data set.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available