4.7 Article

STDnet-ST: Spatio-temporal ConvNet for small object detection

Journal

PATTERN RECOGNITION
Volume 116, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2021.107929

Keywords

Small object detection; Spatio-temporal convolutional network; Object linking

Funding

  1. Spanish Ministry of Science, Innovation and Universities [TIN2017-84796-C2-1-R, RTI2018-097088-B-C32]
  2. Galician Ministry of Education, Culture and Universities [ED431C 2018/29, ED431C 2017/69, ED431G/08]
  3. European Regional Development Fund (ERDF/FEDER program)

Ask authors/readers for more resources

Object detection using convolutional neural networks has achieved unprecedented levels of accuracy, but there is still room for improvement in detecting small objects. Utilizing spatial information alongside temporal video data is a new trend that can potentially enhance overall object detection performance. STDnet-ST is an end-to-end spatio-temporal convolutional neural network designed for detecting small objects in video, achieving state-of-the-art results on various video datasets.
Object detection through convolutional neural networks is reaching unprecedented levels of precision. However, a detailed analysis of the results shows that the accuracy in the detection of small objects is still far from being satisfactory. A recent trend that will likely improve the overall object detection suc-cess is to use the spatial information operating alongside temporal video information. This paper intro-duces STDnet-ST, an end-to-end spatio-temporal convolutional neural network for small object detection in video. We define small as those objects under 16 x 16 px, where the features become less distinc-tive. STDnet-ST is an architecture that detects small objects over time and correlates pairs of the top-ranked regions with the highest likelihood of containing those small objects. This permits to link the small objects across the time as tubelets. Furthermore, we propose a procedure to dismiss unprofitable object links in order to provide high quality tubelets, increasing the accuracy. STDnet-ST is evaluated on the publicly accessible USC-GRAD-STDdb, UAVDT and VisDrone2019-VID video datasets, where it achieves state-of-the-art results for small objects. (c) 2021 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available