4.7 Article

Stagewise Unsupervised Domain Adaptation With Adversarial Self-Training for Road Segmentation of Remote-Sensing Images

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TGRS.2021.3104032

Keywords

Roads; Image segmentation; Adaptation models; Task analysis; Data models; Feature extraction; Predictive models; Remote sensing (RS); road segmentation; self-training; unsupervised domain adaptation (UDA)

Funding

  1. National Natural Science Foundation of China [62076188]
  2. Science and Technology Major Project of Hubei Province (Next-Generation AI Technologies) [2019AEA170]
  3. Fundamental Research Funds for the Central Universities [2042021kf0196]
  4. supercomputing system in the Supercomputing Center of Wuhan University
  5. Australian Research Council (ARC) [FL-170100117]

Ask authors/readers for more resources

The article introduces a novel stagewise domain adaptation model RoadDA, which reduces the domain gap in road segmentation field by utilizing generative adversarial networks for interdomain adaptation and adversarial self-training, outperforming state-of-the-art methods.
Road segmentation from remote-sensing images is a challenging task with wide ranges of application potentials. Deep neural networks have advanced this field by leveraging the power of large-scale labeled data, which, however, are extremely expensive and time-consuming to acquire. One solution is to use cheap available data to train a model and deploy it to directly process the data from a specific application domain. Nevertheless, the well-known domain shift (DS) issue prevents the trained model from generalizing well on the target domain. In this article, we propose a novel stagewise domain adaptation model called RoadDA to address the DS issue in this field. In the first stage, RoadDA adapts the target domain features to align with the source ones via generative adversarial networks (GANs)-based interdomain adaptation. Specifically, a feature pyramid fusion module is devised to avoid information loss of long and thin roads and learn discriminative and robust features. Besides, to address the intradomain discrepancy in the target domain, in the second stage, we propose an adversarial self-training method. We generate the pseudo labels of the target domain using the trained generator and divide it to labeled easy split and unlabeled hard split based on the road confidence scores. The features of hard split are adapted to align with the easy ones using adversarial learning and the intradomain adaptation process is repeated to progressively improve the segmentation performance. Experiment results on two benchmarks demonstrate that RoadDA can efficiently reduce the domain gap and outperforms state-of-the-art methods. The code is available at https://github.com/LANMNG/RoadDA.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Review Computer Science, Artificial Intelligence

A Review on Generative Adversarial Networks: Algorithms, Theory, and Applications

Jie Gui, Zhenan Sun, Yonggang Wen, Dacheng Tao, Jieping Ye

Summary: In this paper, a comprehensive review of various GAN methods is provided from the perspectives of algorithms, theory, and applications. The motivations, mathematical representations, and structures of most GAN algorithms are detailed and compared. Theoretical issues related to GANs are also investigated, and the typical applications of GANs in various fields are discussed.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Information Systems

Uncertainty-Aware Clustering for Unsupervised Domain Adaptive Object Re-Identification

Pengfei Wang, Changxing Ding, Wentao Tan, Mingming Gong, Kui Jia, Dacheng Tao

Summary: Unsupervised Domain Adaptive (UDA) object re-identification (Re-ID) aims to adapt a model trained on a labeled source domain to an unlabeled target domain. To address the issue of label noise caused by clustering algorithms, we propose an uncertainty-aware clustering framework (UCF) for UDA tasks. Our UCF method consistently achieves state-of-the-art performance in multiple UDA tasks for object Re-ID and significantly reduces the performance gap between unsupervised and supervised Re-ID.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Artificial Intelligence

A Survey on Vision Transformer

Kai Han, Yunhe Wang, Hanting Chen, Xinghao Chen, Jianyuan Guo, Zhenhua Liu, Yehui Tang, An Xiao, Chunjing Xu, Yixing Xu, Zhaohui Yang, Yiman Zhang, Dacheng Tao

Summary: Transformer, a deep neural network with a self-attention mechanism, has been initially used in natural language processing and is now gaining attention in computer vision tasks. Transformer-based models perform as well as or even better than convolutional and recurrent neural networks in various visual benchmarks. This paper reviews vision transformer models, categorizes them based on different tasks, and analyzes their advantages and disadvantages. The discussed categories include backbone network, high/mid-level vision, low-level vision, and video processing. Efficient methods for applying transformer in real device-based applications are also explored. The challenges and further research directions for vision transformers are discussed as well.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Book Review Education & Educational Research

Foundations of embodied learning: A paradigm for education

Jing Zhang

EDUCATIONAL PHILOSOPHY AND THEORY (2023)

Book Review Education & Educational Research

Bringing the neuroscience of learning to online teaching: an educator's handbook

Jing Zhang

JOURNAL OF EDUCATION FOR TEACHING (2023)

Article Computer Science, Artificial Intelligence

SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses

Jinlong Fan, Jing Zhang, Dacheng Tao

Summary: This paper proposes a novel self-supervised image rectification method based on the idea that the rectified results of distorted images from different lenses should be the same. A new network architecture is designed with a shared encoder and multiple prediction heads, and a differentiable warping module is used to generate rectified and re-distorted images. The self-supervised learning scheme achieves comparable or better performance than the supervised baseline method and state-of-the-art methods.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Article Computer Science, Artificial Intelligence

ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond

Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao

Summary: Vision transformers have shown promise in computer vision tasks due to their ability to model long-range dependency. However, they lack an intrinsic bias in modeling local visual structures and dealing with scale variance. This paper introduces the ViTAE transformer, which utilizes two biases and achieves superior performance on various datasets.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2023)

Article Computer Science, Artificial Intelligence

An Optimal Transport Analysis on Generalization in Deep Learning

Jingwei Zhang, Tongliang Liu, Dacheng Tao

Summary: In this article, a new analysis of generalization in deep neural networks (DNNs) is proposed from an optimal transport perspective. Upper bounds on the generalization error of learning algorithms are derived based on the algorithmic transport cost, and various conditions for loss functions are studied. The main result shows that the generalization error in DNNs decreases exponentially to zero as the number of layers increases.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Inter-layer transition in neural architecture search

Benteng Ma, Jing Zhang, Yong Xia, Dacheng Tao

Summary: Neural architecture search (NAS) is a popular research topic for identifying better architectures. Recently, differential neural architecture search methods have gained attention for their effectiveness. This paper proposes a novel inter-layer transition NAS method to investigate the dependency between edges in a network.

PATTERN RECOGNITION (2023)

Article Automation & Control Systems

Regionwise Generative Adversarial Image Inpainting for Large Missing Areas

Yuqing Ma, Xianglong Liu, Shihao Bai, Lei Wang, Aishan Liu, Dacheng Tao, Edwin R. Hancock

Summary: In this study, a generic inpainting framework is proposed to handle incomplete images with both contiguous and discontiguous large missing areas. By employing an adversarial modeling and regionwise operations, the framework is able to generate semantically reasonable and visually realistic images, outperforming existing methods on large contiguous and discontiguous missing areas, as demonstrated by qualitative and quantitative experiments.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

Article Automation & Control Systems

The Visual Footsteps Planning System for Exoskeleton Robots Under Complex Terrain

Xinyu Wu, Jinke Li, Liu Liu, Dacheng Tao

Summary: The lower limb power-assist exoskeletons are expected to assist paraplegic individuals in walking again. However, most exoskeletons only work in known environments, making it challenging to understand the user's intention and plan footstep sequences in unknown scenes. This study proposes a visual footstep planning system based on the Bezier curve, integrating Hololens and Realsense for environment understanding and user behavior intention recognition. Experimental results demonstrate that the planning time is reduced by 67.46% compared to traditional search algorithms, validating the effectiveness of the proposed system on a visual interaction platform.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Boosting Graph Contrastive Learning via Adaptive Sampling

Sheng Wan, Yibing Zhan, Shuo Chen, Shirui Pan, Jian Yang, Dacheng Tao, Chen Gong

Summary: Contrastive learning is a key technique for self-supervised representation learning, but the uniform negative sampling strategy limits the expressive power of contrastive models. To address this, the article proposes an adaptive sampling strategy called AdaS and introduces an auxiliary polarization regularizer to improve the performance of graph contrastive learning.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

IC9600: A Benchmark Dataset for Automatic Image Complexity Assessment

Tinglei Feng, Yingjie Zhai, Jufeng Yang, Jie Liang, Deng-Ping Fan, Jing Zhang, Ling Shao, Dacheng Tao

Summary: Image complexity is an important visual perception for humans to understand an image. Evaluating image complexity is challenging due to its subjective nature and the diversity of real-world images. To address this, we have created a large-scale dataset with 9,600 well-annotated images and developed a base model to predict complexity scores and density maps. The model is effective and correlates well with human perception. Additionally, exploring image complexity can enhance the performance of computer vision tasks. The dataset and source code can be found at https://github.com/tinglyfeng/IC9600.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

Learning to Purification for Unsupervised Person Re-Identification

Long Lan, Xiao Teng, Jing Zhang, Xiang Zhang, Dacheng Tao

Summary: In this study, an unsupervised person re-identification method is proposed, which has achieved great progress by training with pseudo labels. To purify the feature and label noise, multi-view features and the knowledge of a teacher model are utilized. Experimental results demonstrate the effectiveness of this approach for unsupervised person re-identification.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Article Computer Science, Artificial Intelligence

Curvature Consistent Network for Microscope Chip Image Super-Resolution

Mingjin Zhang, Jingwei Xin, Jing Zhang, Dacheng Tao, Xinbo Gao

Summary: This article addresses the problem of detecting hardware Trojan from microscope chip images. It proposes a novel MCI super-resolution method using a curvature consistent network, which can recover more delicate circuit lines and improve HT detection performance. Experiments on a new benchmark dataset called MCI demonstrate the superiority of the proposed method over representative SR methods. The MCI dataset is available on GitHub.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

No Data Available