☆ 4.7 Article

Spatio-Temporal Manifold Learning for Human Motions via Long-Horizon Modeling

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2021)

期刊

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS

卷 27, 期 1, 页码 216-227

出版社

IEEE COMPUTER SOC

DOI: 10.1109/TVCG.2019.2936810

关键词

Manifolds; Deep learning; Skeleton; Three-dimensional displays; Feature extraction; Dynamics; Animation; Computer graphics; computer animation; character animation; deep learning

类别

Computer Science, Software Engineering

资金

EPSRC [EP/R031193/1]
NVIDIA Corporation
Royal Society [IESnR2n181024]
EPSRC [EP/R031193/1] Funding Source: UKRI

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Data-driven modeling of human motions is common in computer graphics and computer vision applications, recent research indicates that using deep learning for natural motion manifold learning can address traditional methods' shortcomings. Traditional deep learning methods have issues with under-utilizing skeletal information for feature extraction and handling the multi-modal temporal correlations in motion data.

Data-driven modeling of human motions is ubiquitous in computer graphics and computer vision applications, such as synthesizing realistic motions or recognizing actions. Recent research has shown that such problems can be approached by learning a natural motion manifold using deep learning on a large amount data, to address the shortcomings of traditional data-driven approaches. However, previous deep learning methods can be sub-optimal for two reasons. First, the skeletal information has not been fully utilized for feature extraction. Unlike images, it is difficult to define spatial proximity in skeletal motions in the way that deep networks can be applied for feature extraction. Second, motion is time-series data with strong multi-modal temporal correlations between frames. On the one hand, a frame could be followed by several candidate frames leading to different motions; on the other hand, long-range dependencies exist where a number of frames in the beginning are correlated with a number of frames later. Ineffective temporal modeling would either under-estimate the multi-modality and variance, resulting in featureless mean motion or over-estimate them resulting in jittery motions, which is a major source of visual artifacts. In this paper, we propose a new deep network to tackle these challenges by creating a natural motion manifold that is versatile for many applications. The network has a new spatial component for feature extraction. It is also equipped with a new batch prediction model that predicts a large number of frames at once, such that long-term temporally-based objective functions can be employed to correctly learn the motion multi-modality and variances. With our system, long-duration motions can be predicted/synthesized using an open-loop setup where the motion retains the dynamics accurately. It can also be used for denoising corrupted motions and synthesizing new motions with given control signals. We demonstrate that our system can create superior results comparing to existing work in multiple applications.

Spatio-Temporal Manifold Learning for Human Motions via Long-Horizon Modeling

期刊

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Spatio-Temporal Manifold Learning for Human Motions via Long-Horizon Modeling

期刊

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文