4.7 Article

Stepwise Domain Adaptation (SDA) for Object Detection in Autonomous Vehicles Using an Adaptive CenterNet

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TITS.2022.3164407

关键词

Object detection; Detectors; Training; Deep learning; Feature extraction; Adaptive systems; Task analysis; Autonomous vehicles; advanced driver assistance systems (ADASs); object detection; deep learning; domain adaptation; adversarial learning

资金

  1. National Science Foundation (NSF), China [51805332]
  2. Shenzhen Fundamental Research Fund [JCYJ20190808142613246, 20200803015912001]

向作者/读者索取更多资源

In this paper, a stepwise domain adaptation (SDA) detection method is proposed for cross-domain object detection tasks. The method addresses the domain shift by training an unpaired image-to-image translator to bridge the domain gap, and by using an adaptive CenterNet to minimize the divergence across domains. The results show that the proposed method outperforms existing approaches in object detection scenarios with domain shift.
In recent years, deep learning technologies for object detection have made great progress and have powered the emergence of state-of-the-art models to address object detection problems. Since the domain shift can make detectors unstable or even crash, the detection of cross-domain becomes very important for the design of object detectors. However, traditional deep learning technologies for object detection always rely on a large amount of reliable ground-truth labelling that is laborious, costly, and time-consuming. Although an advanced approach CycleGAN has been proposed for cross-domain object detection tasks, the ability of CycleGAN to reduce the divergence across domains at the feature level is limited. In this paper, a stepwise domain adaptation (SDA) detection method is proposed to further improve the performance of CycleGAN by minimizing the divergence in cross-domain object detection tasks. Specifically, the domain shift is addressed in two steps. In the first step, to bridge the domain gap, an unpaired image-to-image translator is trained to construct a fake target domain by translating the source images to the similar ones in the target domain. In the second step, to further minimize divergence across domains, an adaptive CenterNet is designed to align distributions at the feature level in an adversarial learning manner. Our proposed method is evaluated in domain shift scenarios based on the driving datasets including Cityscapes, Foggy Cityscapes, SIM10k, and BDD100K. The results show that our method is superior to the state-of-the-art methods and is effective for object detection in domain shift scenarios.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Engineering, Biomedical

Evaluation of force-time curve analysis methods in the isometric mid-thigh pull test

Junshi Liu, Xingda Qu, Michael H. Stone

SPORTS BIOMECHANICS (2023)

Article Psychology, Developmental

Characteristics of Visual Fixation in Chinese Children with Autism During Face-to-Face Conversations

Zhong Zhao, Haiming Tang, Xiaobin Zhang, Zhipeng Zhu, Jiayi Xing, Wenzhou Li, Da Tao, Xingda Qu, Jianping Lu

Summary: This study used an eye tracker to record the gaze behavior of Chinese children with ASD and children with typical development during a conversation. The results showed that children with ASD paid less attention to the interlocutor's mouth and whole-face and more to the background. Moreover, gaze behavior varied with the conversational topic.

JOURNAL OF AUTISM AND DEVELOPMENTAL DISORDERS (2023)

Review Engineering, Electrical & Electronic

Review of Clustering Technology and Its Application in Coordinating Vehicle Subsystems

Caizhi Zhang, Weifeng Huang, Tong Niu, Zhitao Liu, Guofa Li, Dongpu Cao

Summary: Clustering is an unsupervised learning technology that groups information based on similarity measures. This paper introduces the concept of clustering and analyzes clustering technologies from traditional and modern perspectives. It summarizes the principles, advantages, and disadvantages of various traditional and modern clustering algorithms, and presents core elements such as similarity measures and evaluation index. The paper also lists specific applications of clustering algorithms in vehicles and highlights the future development of clustering in the era of big data.

AUTOMOTIVE INNOVATION (2023)

Article Engineering, Industrial

Input modality matters: A comparison of touch, speech, and gesture based in-vehicle interaction

Tingru Zhang, Xing Liu, Weisheng Zeng, Da Tao, Guofa Li, Xingda Qu

Summary: This study compared the effects of three novel input modalities (touchscreen, speech-based, and gesture-based) on driving performance and driver visual behaviors. The results showed that touchscreen interaction had a negative impact on driving performance, while gesture-based interaction had a smaller but still significant crash risk. Speech-based interaction had the least influence on driving and visual performance. The effects of different modalities were robust across different non-driving related tasks (NDRTs).

APPLIED ERGONOMICS (2023)

Article Engineering, Electrical & Electronic

Driver Vigilance Detection Based on Limited EEG Signals

Guofa Li, Long Zhang, Ying Zou, Delin Ouyang, Yufei Yuan, Qiuyan Lian, Wenbo Chu, Gang Guo

Summary: This article examines the potential of using EEG signals from only one frequency band or from only a small subset of related electrode channels in recognizing driver vigilance state. The experimental results show that the recognition accuracy is higher when using EEG signals from the selected frequency band (i.e., alpha band) or the selected electrodes (i.e., T7, TP7, and CP1) than when using all the data. These results indicate that higher driver vigilance recognition accuracy can be achieved with much less amount of data, which would facilitate the development of wearable equipment based on EEG signals.

IEEE SENSORS JOURNAL (2023)

Review Engineering, Electrical & Electronic

Sensing and Machine Learning for Automotive Perception: A Review

Ashish Pandharipande, Chih-Hong Cheng, Justin Dauwels, Sevgi Z. Gurbuz, Javier Ibanez-Guzman, Guofa Li, Andrea Piazzoni, Pu Wang, Avik Santra

Summary: Automotive perception is crucial for achieving high levels of safety and autonomy in driving, involving the understanding of the external driving environment and the internal state of the vehicle cabin and occupants using sensor data. This article provides an overview of different sensor modalities commonly used for perception, along with data processing techniques. Critical aspects such as architectures, algorithms, and safety are discussed, with a focus on machine learning approaches. Future research opportunities in automotive perception are also outlined.

IEEE SENSORS JOURNAL (2023)

Article Computer Science, Artificial Intelligence

Driver Behavioral Cloning for Route Following in Autonomous Vehicles Using Task Knowledge Distillation

Guofa Li, Zefeng Ji, Shen Li, Xiao Luo, Xingda Qu

Summary: This paper proposes a new off-policy imitation learning method for autonomous driving by using task knowledge distillation, which overcomes the dependence on large scales of time-consuming, laborious, and reliable labels in existing behavioral cloning methods. The experiment results show that our method can achieve satisfactory route-following performance in realistic urban driving scenes and can transfer the driving strategies to new unknown scenes under various illumination and weather scenarios for autonomous driving.

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES (2023)

Article Computer Science, Artificial Intelligence

Lane Change Strategies for Autonomous Vehicles: A Deep Reinforcement Learning Approach Based on Transformer

Guofa Li, Yifan Qiu, Yifan Yang, Zhenning Li, Shen Li, Wenbo Chu, Paul Green, Shengbo Eben Li

Summary: End-to-end approaches are a promising solution for AV decision-making, but their deployment is often hindered by high computational burden. To address this, we propose a lightweight transformer-based end-to-end model with risk awareness for AV decision-making. We introduce a lightweight network with depth-wise separable convolution and transformer modules to extract image semantics from trajectory data. We then assess driving risk using a probabilistic model with position uncertainty and integrate it into deep reinforcement learning to find strategies with minimum expected risk. The proposed method is evaluated in three lane change scenarios to validate its superiority.

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES (2023)

Article Engineering, Electrical & Electronic

Depth Estimation Based on Monocular Camera Sensors in Autonomous Vehicles: A Self-supervised Learning Approach

Guofa Li, Xingyu Chi, Xingda Qu

Summary: This paper proposes an improved bidirectional feature pyramid module (BiFPN) and a channel attention module (Seblock) to address issues in existing methods based on monocular camera sensor. The improved BiFPN facilitates efficient fusion of multi-scale features, while Seblock enhances useful information by redistributing channel feature weights. Experimental results demonstrate that this method is competitive with state-of-the-art algorithms and preserves fine-grained texture of scene depth.

AUTOMOTIVE INNOVATION (2023)

Article Engineering, Electrical & Electronic

Pedestrian Tracking Based on Receptive Field Improvement: A One-Shot Multiobject Tracking Approach Based on Vision Sensors

Guofa Li, Delin Ouyang, Xin Chen, Wenbo Chu, Bing Lu, Caizhi Zhang, Xiaolin Tang, Gang Guo

Summary: This article introduces a reinforced one-shot multiobject tracking system that utilizes new methods to improve tracking accuracy. The article first presents a receptive field module to address the problem of target scale transformation, then designs an attention mechanism network to extract channel and positional information, and finally combines circle loss with Euclidean distance optimization and cross-entropy loss to enhance the learning of discriminative embeddings.

IEEE SENSORS JOURNAL (2023)

Article Engineering, Electrical & Electronic

Key Supplement: Improving 3-D Car Detection With Pseudo Point Cloud

Guofa Li, Junda Li, Cong Wang, Qianlei Peng, Caizhi Zhang, Feng Gao, Xiaolin Tang, Gang Guo

Summary: In the development of autonomous driving technologies, 3-D car detection based on LiDAR has been recognized as a key research topic. However, LiDAR-based methods are limited by the incomplete point cloud problem caused by signal missing and object occlusion. To tackle this challenge, we introduce a novel early fusion method called key supplement, which supplements the key area with pseudo point cloud generated by images to address the incomplete point cloud problem. We also propose a multimodal method to reconstruct the incomplete point cloud. Experimental results demonstrate that our proposed key supplement method improves the performances of state-of-the-art LiDAR-based methods.

IEEE SENSORS JOURNAL (2023)

Article Engineering, Civil

SOSMaskFuse: An Infrared and Visible Image Fusion Architecture Based on Salient Object Segmentation Mask

Guofa Li, Xuanhu Qian, Xingda Qu

Summary: High-quality fusion images with infrared and visible information play a crucial role in intelligent and safe driving. To address the issue of unclear fusion images due to noise in the infrared image and loss of texture information from the visible image, we propose a novel two-stage network called SOSMaskFuse. The experimental results demonstrate that our proposed network outperforms other algorithms in terms of quality and effectiveness.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2023)

Article Engineering, Civil

Latent Hazard Notification for Highly Automated Driving: Expected Safety Benefits and Driver Behavioral Adaptation

Qingkun Li, Yizi Su, Wenjun Wang, Zhenyuan Wang, Jibo He, Guofa Li, Chao Zeng, Bo Cheng

Summary: Although latent hazard notification for highly automated driving is expected to enhance traffic safety, its practical effects have yet to be verified. This study investigated the expected safety benefits and driver behavioral adaptation based on structural equation modeling. The findings reveal that while latent hazard notification significantly improves driver attention and enhances traffic safety, it also increases driver trust and impairs traffic safety due to driver behavioral adaptation.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

A Spontaneous Driver Emotion Facial Expression (DEFE) Dataset for Intelligent Vehicles: Emotions Triggered by Video-Audio Clips in Driving Scenarios

Wenbo Li, Yaodong Cui, Yintao Ma, Xingxin Chen, Guofa Li, Guanzhong Zeng, Gang Guo, Dongpu Cao

Summary: This article introduces a new dataset, the driver emotion facial expression (DEFE) dataset, for analyzing drivers' spontaneous emotions. The dataset consists of facial expression recordings of 60 participants during driving. Each participant completed driving tasks in the same scenario and rated their emotional responses using the dimensional emotion method and discrete emotion method. Classification experiments were conducted to recognize arousal, valence, dominance scales, emotion category, and intensity as baseline results. Additionally, the article compares emotion recognition results between dynamic driving and static life scenarios through facial expressions.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2023)

暂无数据