4.7 Article

Object Detection in 20 Years: A Survey

期刊

PROCEEDINGS OF THE IEEE
卷 111, 期 3, 页码 257-276

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JPROC.2023.3238524

关键词

Object detection; Detectors; Computer vision; Feature extraction; Deep learning; Convolutional neural networks; convolutional neural networks (CNNs); deep learning; object detection; technical evolution

向作者/读者索取更多资源

Object detection, a fundamental problem in computer vision, has received significant attention in recent years. This article reviews the rapid technological evolution of object detection over the past two decades and its impact on the entire computer vision field. It covers various topics such as milestone detectors, datasets, metrics, fundamental building blocks, speedup techniques, and state-of-the-art methods.
Object detection, as of one the most fundamental and challenging problems in computer vision, has received great attention in recent years. Over the past two decades, we have seen a rapid technological evolution of object detection and its profound impact on the entire computer vision field. If we consider today's object detection technique as a revolution driven by deep learning, then, back in the 1990s, we would see the ingenious thinking and long-term perspective design of early computer vision. This article extensively reviews this fast-moving research field in the light of technical evolution, spanning over a quarter-century's time (from the 1990s to 2022). A number of topics have been covered in this article, including the milestone detectors in history, detection datasets, metrics, fundamental building blocks of the detection system, speedup techniques, and recent state-of-the-art detection methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Geochemistry & Geophysics

High-Resolution Remote Sensing Image Captioning Based on Structured Attention

Rui Zhao, Zhenwei Shi, Zhengxia Zou

Summary: An attention-based method that generates pixelwise semantic content segmentation masks for remote sensing images is proposed, achieving higher captioning accuracy compared to state-of-the-art methods. This method utilizes fine-grained, structured attention to exploit the structural characteristics of semantic contents in high-resolution remote sensing images.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2022)

Article Computer Science, Artificial Intelligence

Neural Rendering for Game Character Auto-Creation

Tianyang Shi, Zhengxia Zou, Zhenwei Shi, Yi Yuan

Summary: This paper proposes a new method to automatically create game characters based on a single face photo, utilizing a imitator network and gradient descent for optimization. Unlike previous methods, our approach produces fine-grained facial parameters with physical significance, allowing users to fine-tune their characters.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Engineering, Civil

Design, Implementation, and Evaluation of a Roadside Cooperative Perception System

Rusheng Zhang, Zhengxia Zou, Shengyin Shen, Henry X. Liu

Summary: This paper introduces a newly developed and deployed roadside cooperative perception system with an edge-cloud structure and multiple kinds of sensors. The performance of the system is analyzed using data collected from the field, and its potential in applications such as traffic volume monitoring and road safety warning is demonstrated.

TRANSPORTATION RESEARCH RECORD (2022)

Article Geochemistry & Geophysics

Resolution-Agnostic Remote Sensing Scene Classification With Implicit Neural Representations

Keyan Chen, Wenyuan Li, Jianqi Chen, Zhengxia Zou, Zhenwei Shi

Summary: Remote sensing scene classification is a challenging task, but recent advances in convolutional neural networks (CNNs) have improved accuracy. However, handling resolution variations in remote sensing images is still challenging for CNNs. In this letter, we propose a novel scene classification method that leverages implicit neural representations (INRs) to adapt to scale and resolution changes. By converting the images into continuous functions and performing classification within the function space, our method achieves comparable accuracy to deep CNNs while exhibiting better adaptability to resolution changes.

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS (2023)

Article Geochemistry & Geophysics

Hyperspectral Remote Sensing Image Synthesis Based on Implicit Neural Spectral Mixing Models

Liqin Liu, Zhengxia Zou, Zhenwei Shi

Summary: This article introduces a novel method for modeling nonlinear spectral mixtures based on implicit neural representations (INRs), and proposes a new method for hyperspectral image (HSI) synthesis. The proposed method adaptively implements different mixture models for each pixel according to their spectral signature and surrounding environment, producing accurate and physically meaningful HSIs. It also generates subpixel-level spectral abundance and the solar atmosphere signature as by-products.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Article Engineering, Environmental

Sunlight Significantly Enhances Soil Denitrification via an Interfacial Biophotoelectrochemical Pathway

Shaofu Huang, Keyan Chen, Xiangyu Chen, Hanpeng Liao, Raymond Jianxiong Zeng, Shungui Zhou, Man Chen

Summary: This work demonstrated a biophotoelectrochemical pathway that enhances soil denitrification by directly exciting soil. The investigation of sunlight's influence on soil denitrification has mainly focused on plant photosynthesis-mediated processes. However, this study discovered a new pathway, the biophotoelectrochemical process, which greatly accelerates soil denitrification by generating more electron sources through photoinduced ferrous substrates and photoelectrons.

ENVIRONMENTAL SCIENCE & TECHNOLOGY (2023)

Article Geochemistry & Geophysics

Continuous Remote Sensing Image Super-Resolution Based on Context Interaction in Implicit Function Space

Keyan Chen, Wenyuan Li, Sen Lei, Jianqi Chen, Xiaolong Jiang, Zhengxia Zou, Zhenwei Shi

Summary: A highly applicable super-resolution (SR) framework called FunSR is proposed, which handles different magnifications with a unified model. FunSR shows state-of-the-art performance on both fixed- and continuous-magnification settings and provides many friendly applications due to its unified nature.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Article Geochemistry & Geophysics

Continuous Cross-Resolution Remote Sensing Image Change Detection

Hao Chen, Haotian Zhang, Keyan Chen, Chenyao Zhou, Song Chen, Zhengxia Zou, Zhenwei Shi

Summary: In this paper, a method for change detection in remote sensing images with different resolutions is proposed. The model is trained using synthesized samples to consistently predict high-resolution results for varying resolution differences. Experimental results demonstrate that the proposed method outperforms other methods on multiple datasets.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Article Geochemistry & Geophysics

A Decoupling Paradigm With Prompt Learning for Remote Sensing Image Change Captioning

Chenyang Liu, Rui Zhao, Jianqi Chen, Zipeng Qi, Zhengxia Zou, Zhenwei Shi

Summary: Remote sensing image change captioning (RSICC) is a novel task that aims to describe the differences between bitemporal images by natural language. This article decouples the task into two issues and utilizes prompt learning to generate captions with pretrained large language models (LLMs). Extensive experiments show that the method is effective and achieves state-of-the-art performance.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Article Geochemistry & Geophysics

Contrastive Learning for Fine-Grained Ship Classification in Remote Sensing Images

Jianqi Chen, Keyan Chen, Hao Chen, Wenyuan Li, Zhengxia Zou, Zhenwei Shi

Summary: The article introduces an asynchronous contrastive learning-based method for effective fine-grained ship classification in remote sensing images. The method, called Push-and-Pull Network (P(2)Net), separates images using a dual-branch network and aggregates them into subclasses using an integration module. Experimental results demonstrate the effectiveness of the proposed method.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2022)

Article Geochemistry & Geophysics

Physics-Informed Hyperspectral Remote Sensing Image Synthesis With Deep Conditional Generative Adversarial Networks

Liqin Liu, Wenyuan Li, Zhenwei Shi, Zhengxia Zou

Summary: This article proposes a deep learning-based method for generating high-resolution hyperspectral images and subpixel ground-truth annotations from RGB images. The method takes into consideration the imaging mechanism and spectral mixing, and employs a deep generative network along with a spectral library to generate synthetic spectral data. The addition of spatial and spectral discriminative networks improves the accuracy of the synthetic output.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2022)

暂无数据