4.6 Article

Coupled Deep Autoencoder for Single Image Super-Resolution

Journal

IEEE TRANSACTIONS ON CYBERNETICS
Volume 47, Issue 1, Pages 27-37

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2015.2501373

Keywords

Autoencoder; deep learning; single image super-resolution (SR)

Funding

  1. National Natural Science Foundation of China [61472110, 61373077]
  2. Zhejiang Provincial Natural Science Foundation of China [LR15F020002]
  3. Specialized Research Fund for the Doctoral Program of Higher Education of China [20110121110020]
  4. National Defense Basic Scientific Research Program of China [B-0110155]
  5. National Defense Science and Technology Key Laboratory Foundation [9140C-30211ZS-8]

Ask authors/readers for more resources

Sparse coding has been widely applied to learning-based single image super-resolution (SR) and has obtained promising performance by jointly learning effective representations for low-resolution (LR) and high-resolution (HR) image patch pairs. However, the resulting HR images often suffer from ringing, jaggy, and blurring artifacts due to the strong yet ad hoc assumptions that the LR image patch representation is equal to, is linear with, lies on a manifold similar to, or has the same support set as the corresponding HR image patch representation. Motivated by the success of deep learning, we develop a data-driven model coupled deep autoencoder (CDA) for single image SR. CDA is based on a new deep architecture and has high representational capability. CDA simultaneously learns the intrinsic representations of LR and HR image patches and a big-data-driven function that precisely maps these LR representations to their corresponding HR representations. Extensive experimentation demonstrates the superior effectiveness and efficiency of CDA for single image SR compared to other state-of-the-art methods on Set5 and Set14 datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

A Survey on Vision Transformer

Kai Han, Yunhe Wang, Hanting Chen, Xinghao Chen, Jianyuan Guo, Zhenhua Liu, Yehui Tang, An Xiao, Chunjing Xu, Yixing Xu, Zhaohui Yang, Yiman Zhang, Dacheng Tao

Summary: Transformer, a deep neural network with a self-attention mechanism, has been initially used in natural language processing and is now gaining attention in computer vision tasks. Transformer-based models perform as well as or even better than convolutional and recurrent neural networks in various visual benchmarks. This paper reviews vision transformer models, categorizes them based on different tasks, and analyzes their advantages and disadvantages. The discussed categories include backbone network, high/mid-level vision, low-level vision, and video processing. Efficient methods for applying transformer in real device-based applications are also explored. The challenges and further research directions for vision transformers are discussed as well.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

Generalised Watson Distribution on the Hypersphere with Applications to Clustering

Stephen J. Maybank, Liu Liu, Dacheng Tao

Summary: This study defines a family of probability density functions on the unit hypersphere S-n and investigates their properties and parameter estimation methods. Various shapes of probability density functions can be obtained by adjusting the parameters. Experiments show that clustering algorithms based on Kullback-Leibler divergence can achieve good results even in high-dimensional scenarios.

JOURNAL OF MATHEMATICAL IMAGING AND VISION (2023)

Article Computer Science, Artificial Intelligence

Stochastically Controlled Compositional Gradient for Composition Problems

Liu Liu, Ji Liu, Cho-Jui Hsieh, Dacheng Tao

Summary: This article discusses the composition problems of a specific form and proposes a stochastically controlled compositional gradient algorithm that significantly reduces computational cost. The proposed method improves composition algorithms under low target accuracy, as demonstrated through theoretical proofs and experiments.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Deep Multiview Collaborative Clustering

Xu Yang, Cheng Deng, Zhiyuan Dang, Dacheng Tao

Summary: In this article, a novel multiview clustering model is proposed, which utilizes multiple autoencoder networks to embed multiview data into different latent spaces. A heterogeneous graph learning module is employed to adaptively fuse the latent representations, and intraview and interview collaborative learning are used to optimize the clustering results. Experimental results show that this method significantly outperforms other clustering approaches on multiple datasets.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

On the Guaranteed Almost Equivalence Between Imitation Learning From Observation and Demonstration

Zhihao Cheng, Liu Liu, Aishan Liu, Hao Sun, Meng Fang, Dacheng Tao

Summary: Imitation learning from observation (LfO) is more preferable than imitation learning from demonstration (LfD) due to the non-necessity of expert actions. This article proves that LfO is almost equivalent to LfD in the deterministic robot environment and even in the robot environment with bounded randomness. Extensive experiments demonstrate that LfO achieves comparable performance to LfD. This suggests that LfO can be safely applied in practice without sacrificing performance compared to LfD.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Automation & Control Systems

RSAL-iMFS: A framework of randomized stacking with active learning for incremental multi-fidelity surrogate modeling

Zongqi Liu, Xueguan Song, Chao Zhang, Yunsheng Ma, Dacheng Tao

Summary: This paper proposes a framework of randomized stacking with active learning for incremental multi-fidelity surrogate (MFS) modeling. It randomly projects the inputs of low-fidelity (LF) samples into different spaces and builds a series of LF regressors to capture the LF features. These base LF regressors are stacked to form the inputs of the subsequent incremental Gaussian process regression (iGPR) model for approximating the high-fidelity (HF) responses. The framework also adopts a query-by-committee (QBC)-based active learning method to incrementally update the current iGPR model.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

Hierarchical Prototype Networks for Continual Graph Representation Learning

Xikun Zhang, Dongjin Song, Dacheng Tao

Summary: Despite the progress in graph representation learning, little attention has been given to the continual learning scenario where new categories of nodes and their associated edges continuously emerge. Existing methods either ignore topological information or sacrifice stability for plasticity. To address this, the Hierarchical Prototype Networks (HPNs) extract abstract knowledge in the form of prototypes to represent expanded graphs. HPNs select relevant features and prototypes to adapt to new categories, maintaining performance over existing nodes. The experimental results show that HPNs outperform baseline techniques while consuming less memory.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Physics, Applied

QAOA-in-QAOA: Solving Large-Scale MaxCut Problems on Small Quantum Machines

Zeqiao Zhou, Yuxuan Du, Xinmei Tian, Dacheng Tao

Summary: The design of efficient combinatorial optimization algorithms is crucial in various fields such as logistics, finance, and chemistry. This article proposes the QAOA-in-QAOA (QAOA2) algorithm to address large-scale MaxCut problems using small quantum machines, by applying the divide-and-conquer heuristic. The performance of QAOA2 is proven to be competitive or better than classical algorithms when the node count is around 2000.

PHYSICAL REVIEW APPLIED (2023)

Article Computer Science, Artificial Intelligence

Deep Corner

Shanshan Zhao, Mingming Gong, Haimei Zhao, Jing Zhang, Dacheng Tao

Summary: Recent studies have achieved promising results by jointly learning local feature detectors and descriptors. To overcome the lack of ground-truth keypoint supervision, previous methods have incorporated relevant knowledge about keypoint attributes into the network for enhanced model learning. This paper presents Deep Corner, an end-to-end deep network that combines a local similarity-based keypoint measure with a plain convolutional network, inspired by traditional corner detectors. The proposed method yields reliable keypoints, facilitate the learning of distinctive descriptors. Additionally, the paper introduces a multi-level U-Net architecture and a feature self-transformation operation to further improve keypoint localization and descriptor invariance.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2023)

Article Automation & Control Systems

The Visual Footsteps Planning System for Exoskeleton Robots Under Complex Terrain

Xinyu Wu, Jinke Li, Liu Liu, Dacheng Tao

Summary: The lower limb power-assist exoskeletons are expected to assist paraplegic individuals in walking again. However, most exoskeletons only work in known environments, making it challenging to understand the user's intention and plan footstep sequences in unknown scenes. This study proposes a visual footstep planning system based on the Bezier curve, integrating Hololens and Realsense for environment understanding and user behavior intention recognition. Experimental results demonstrate that the planning time is reduced by 67.46% compared to traditional search algorithms, validating the effectiveness of the proposed system on a visual interaction platform.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Learning to Purification for Unsupervised Person Re-Identification

Long Lan, Xiao Teng, Jing Zhang, Xiang Zhang, Dacheng Tao

Summary: In this study, an unsupervised person re-identification method is proposed, which has achieved great progress by training with pseudo labels. To purify the feature and label noise, multi-view features and the knowledge of a teacher model are utilized. Experimental results demonstrate the effectiveness of this approach for unsupervised person re-identification.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Article Computer Science, Artificial Intelligence

Unsupervised Structure-Adaptive Graph Contrastive Learning

Han Zhao, Xu Yang, Cheng Deng, Dacheng Tao

Summary: In this study, we propose a structure-adaptive graph contrastive learning framework to capture potential discriminative relationships for improved graph representation learning.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Geochemistry & Geophysics

Advancing Plain Vision Transformer Toward Remote Sensing Foundation Model

Di Wang, Qiming Zhang, Yufei Xu, Jing Zhang, Bo Du, Dacheng Tao, Liangpei Zhang

Summary: Large-scale vision models tailored to remote sensing tasks are proposed in this article, using Vision Transformers and a new rotated varied-size window attention mechanism. The experiments demonstrate the superior performance of the model in detection, classification, and segmentation tasks, as well as its advantages in terms of computational complexity and data efficiency.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Article Geochemistry & Geophysics

An Empirical Study of Remote Sensing Pretraining

Di Wang, Jing Zhang, Bo Du, Gui-Song Xia, Dacheng Tao

Summary: Deep learning has achieved great success in aerial image understanding for remote sensing research. However, most existing models are pretrained with ImageNet weights which hinder their fine-tuning performance on downstream aerial scene tasks due to domain gaps. This study empirically investigates RS pretraining on aerial images, training different networks from scratch using the MillionAID dataset to obtain pretrained backbones. Results show that RS pretraining enhances performance in scene recognition and RS-related semantics tasks, but task discrepancies still exist, highlighting the need for further research on large-scale pretraining datasets and effective methods.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Proceedings Paper Computer Science, Artificial Intelligence

1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results

Benjamin Kiefer, Matej Kristan, Janez Pers, Lojze Zust, Fabio Poiesi, Fabio Augusto de Alcantara Andrade, Alexandre Bernardino, Matthew Dawkins, Jenni Raitoharju, Yitong Quan, Adem Atmaca, Timon Hoefer, Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao, Lars Sommer, Raphael Spraul, Hangyue Zhao, Hongpu Zhang, Yanyun Zhao, Jan Lukas Augustin, Eui-Ik Jeon, Impyeong Lee, Luca Zedda, Andrea Loddo, Cecilia Di Ruberto, Sagar Verma, Siddharth Gupta, Shishir Muralidhara, Niharika Hegde, Daitao Xing, Nikolaos Evangeliou, Anthony Tzes, Vojtech Bartl, Jakub Spanhel, Adam Herout, Neelanjan Bhowmik, Toby P. Breckon, Shivanand Kundargi, Tejas Anvekar, Ramesh Ashok Tabib, Uma Mudengudi, Arpita Vats, Yang Song, Delong Liu, Yonglin Li, Shuman Li, Chenhao Tan, Long Lan, Vladimir Somers, Christophe De Vleeschouwer, Alexandre Alahi, Hsiang-Wei Huang, Cheng-Yen Yang, Jenq-Neng Hwang, Pyong-Kun Kim, Kwangju Kim, Kyoungoh Lee, Shuai Jiang, Haiwen Li, Ziqiang Zheng, Tuan-Anh Vu, Hai Nguyen-Truong, Sai-Kit Yeung, Zhuang Jia, Sophia Yang, Chih-Chung Hsu, Xiu-Yu Hou, Yu-An Jhang, Simon Yang, Mau-Tsuen Yang

Summary: The 1st Workshop on Maritime Computer Vision (MaCVi) 2023 focused on maritime computer vision for UAV and USV and organized subchallenges in areas such as object detection, tracking, obstacle segmentation, and detection. The report summarizes the main findings of the subchallenges and introduces a new benchmark called SeaDronesSee Object Detection v2. Over 130 submissions were evaluated and trends in the best-performing methodologies were assessed. The datasets, evaluation code, and leaderboard are publicly available.

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW) (2023)

No Data Available