4.6 Article

3-D Deconvolutional Networks for the Unsupervised Representation Learning of Human Motions

期刊

IEEE TRANSACTIONS ON CYBERNETICS
卷 52, 期 1, 页码 398-410

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCYB.2020.2973300

关键词

Machine learning; Task analysis; Correlation; Optimization; Feature extraction; Convolution; Data models; 3-D deconvolutional networks (3DDNs); data representation; human motion analysis; unsupervised learning; video representation learning

资金

  1. National Natural Science Foundation of China [61603096, 61751202, 61751205, 61572540, U1813203, U1801262]
  2. Natural Science Foundation of Fujian Province [2017J01750]

向作者/读者索取更多资源

Data representation learning is a crucial problem in machine learning. This article proposes a new 3D deconvolutional network (3DDN) for representation learning of high-dimensional video data. The proposed 3DDN is evaluated in human action recognition and shows comparable results to feedforward convolutional neural networks (CNNs).
Data representation learning is one of the most important problems in machine learning. Unsupervised representation learning becomes meritorious as it has no necessity of label information with observed data. Due to the highly time-consuming learning of deep-learning models, there are many machine-learning models directly adapting well-trained deep models that are obtained in a supervised and end-to-end manner as feature abstractors to distinct problems. However, it is obvious that different machine-learning tasks require disparate representation of original input data. Taking human action recognition as an example, it is well known that human actions in a video sequence are 3-D signals containing both visual appearance and motion dynamics of humans and objects. Therefore, the data representation approaches with the capabilities to capture both spatial and temporal correlations in videos are meaningful. Most of the existing human motion recognition models build classifiers based on deep-learning structures such as deep convolutional networks. These models require a large quantity of training videos with annotations. Meanwhile, these supervised models cannot recognize samples from the distinct dataset without retraining. In this article, we propose a new 3-D deconvolutional network (3DDN) for representation learning of high-dimensional video data, in which the high-level features are obtained through the optimization approach. The proposed 3DDN decomposes the video frames into spatiotemporal features under a sparse constraint in an unsupervised way. In addition, it also can be regarded as a building block to develop deep architectures by stacking. The high-level representation of input sequential data can be used in multiple downstream machine-learning tasks, we evaluate the proposed 3DDN and its deep models in human action recognition. The experimental results from three datasets: 1) KTH data; 2) HMDB-51; and 3) UCF-101, demonstrate that the proposed 3DDN is an alternative approach to feedforward convolutional neural networks (CNNs), that attains comparable results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

Fuzzy Restricted Boltzmann Machine for the Enhancement of Deep Learning

C. L. Philip Chen, Chun-Yang Zhang, Long Chen, Min Gan

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2015)

Article Green & Sustainable Science & Technology

Predictive Deep Boltzmann Machine for Multiperiod Wind Speed Forecasting

Chun-Yang Zhang, C. L. Philip Chen, Min Gan, Long Chen

IEEE TRANSACTIONS ON SUSTAINABLE ENERGY (2015)

Article Automation & Control Systems

Extraversion Measure for Crowd Trajectories

Wenxi Liu, Chun-Yang Zhang, Genggeng Liu, Yaru Su, Neil N. Xiong

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS (2019)

Article Automation & Control Systems

Dynamic Representation Learning via Recurrent Graph Neural Networks

Chun-Yang Zhang, Zhi-Liang Yao, Hong-Yu Yao, Feng Huang, C. L. Philip Chen

Summary: This article introduces a dynamic graph representation learning model called DynGNN, which embeds an RNN into a graph neural network to better capture the temporal dynamics and topological correlations of graphs and achieves significant improvements in multiple tasks.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2023)

Article Engineering, Multidisciplinary

Graph Representation Learning With Adaptive Metric

Chun-Yang Zhang, Hai-Chun Cai, C. L. Philip Chen, Yue-Na Lin, Wu-Peng Fang

Summary: Contrastive learning is widely used in graph representation learning, but most existing models ignore the diversity of node attributes and network topologies. To address this, we propose a novel graph representation learning model called GRAM, which uses an adaptive metric to generate appropriate similarity scores for node pairs based on the significance of each dimension in their embedding vectors and data distribution. Experimental results demonstrate that GRAM is highly competitive in multiple tasks.

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING (2023)

Article Computer Science, Cybernetics

Sparse Graph Transformer With Contrastive Learning

Chun-Yang Zhang, Wu-Peng Fang, Hai-Chun Cai, C. L. Philip Chen, Yue-Na Lin

Summary: This article proposes a Sparse Graph Transformer model (SGTC) for graph representation learning. It removes redundant topological information using centrality measures, learns node representations in a contrastive manner, and introduces a sparse attention mechanism to capture structural features of graphs. Experimental results demonstrate that SGTC outperforms existing baselines in terms of performance.

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Fuzzy Representation Learning on Graph

Chun-Yang Zhang, Yue-Na Lin, C. L. Philip Chen, Hong-Yu Yao, Hai-Chun Cai, Wu-Peng Fang

Summary: In recent years, there has been a significant increase in graph representation learning. This article proposes an unsupervised fuzzy representation learning model that improves the expressiveness of graphs and networks by making crisp representations fuzzy. The model utilizes fuzzy logic to fully explore feature-level uncertainties and generate fuzzy representations that are more expressive than crisp ones.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

A new robust contrastive learning for unsupervised person re-identification

Huibin Lin, Hai-Tao Fu, Chun-Yang Zhang, C. L. Philip Chen

Summary: This study proposes a new robust unsupervised person re-identification model that solves the problem of misclassification caused by clustering operations, achieving a better exploration of implicit relationships between samples.

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2023)

Article Automation & Control Systems

Fuzzy Representation Learning on Dynamic Graphs

Hong-Yu Yao, Yuan-Long Yu, Chun-Yang Zhang, Yue-Na Lin, Shang-Jia Li

Summary: This paper proposes a new dynamic graph representation learning model called FuzzyDGL, which incorporates fuzzy representation learning to handle uncertainties in dynamic graphs. Experimental results show that FuzzyDGL has strong competitiveness and generalization in tasks like link prediction and node classification.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2023)

Article Engineering, Electrical & Electronic

STS-DGNN: Vehicle Trajectory Prediction via Dynamic Graph Neural Network With Spatial-Temporal Synchronization

Feng-Jie Li, Chun-Yang Zhang, C. L. Philip Chen

Summary: Accurate prediction of vehicle trajectories is crucial for autonomous vehicles. Existing models have limitations in capturing both spatial and temporal dependencies. Therefore, a novel dynamic graph neural network is proposed to jointly extract spatial-temporal features and consider low-order and high-order dynamics collaboratively.

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT (2023)

Article Computer Science, Artificial Intelligence

A Fuzzy Deep Model Based on Fuzzy Restricted Boltzmann Machines for High-Dimensional Data Classification

Shuang Feng, C. L. Philip Chen, Chun-Yang Zhang

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2020)

暂无数据