☆ 4.8 Article

Real-Time Simultaneous Pose and Shape Estimation for Articulated Objects Using a Single Depth Camera

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2016)

期刊

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

卷 38, 期 8, 页码 1517-1532

出版社

IEEE COMPUTER SOC

DOI: 10.1109/TPAMI.2016.2557783

关键词

Generative pose tracking; shape registration; real-time tracking; motion; depth cues; range data; surface fitting

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

Direct For Computer & Info Scie & Enginr [1231545] Funding Source: National Science Foundation
Directorate For Engineering [1543172] Funding Source: National Science Foundation
Div Of Industrial Innovation & Partnersh [1543172] Funding Source: National Science Foundation
Div Of Information & Intelligent Systems [1231545] Funding Source: National Science Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

In this paper we present a novel real-time algorithm for simultaneous pose and shape estimation for articulated objects, such as human beings and animals. The key of our pose estimation component is to embed the articulated deformation model with exponential-maps-based parametrization into a Gaussian Mixture Model. Benefiting from this probabilistic measurement model, our algorithm requires no explicit point correspondences as opposed to most existing methods. Consequently, our approach is less sensitive to local minimum and handles fast and complex motions well. Moreover, our novel shape adaptation algorithm based on the same probabilistic model automatically captures the shape of the subjects during the dynamic pose estimation process. The personalized shape model in turn improves the tracking accuracy. Furthermore, we propose novel approaches to use either a mesh model or a sphere-set model as the template for both pose and shape estimation under this unified framework. Extensive evaluations on publicly available data sets demonstrate that our method outperforms most state-of-the-art pose estimation algorithms with large margin, especially in the case of challenging motions. Furthermore, our shape estimation method achieves comparable accuracy with state of the arts, yet requires neither statistical shape model nor extra calibration procedure. Our algorithm is not only accurate but also fast, we have implemented the entire processing pipeline on GPU. It can achieve up to 60 frames per second on a middle-range graphics card.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.8

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Comparison of Graph Fitting and Sparse Deep Learning Model for Robot Pose Estimation

Jan Rodziewicz-Bielewicz, Marcin Korze

Summary: This paper presents a simple yet robust computer vision system for robot arm tracking using RGB-D cameras. It tracks the robot's state in real time, given three angles and known restrictions about the robot's geometry. The system consists of two parts: image preprocessing and machine learning. In the machine learning part, two approaches are compared: fitting the robot pose to the point cloud and fitting the convolutional neural network model to the sparse 3D depth images. The presented approach directly uses the point cloud transformed to the sparse image in the network input, utilizing sparse CNN layers. Experiments confirm real-time robot tracking with accuracy comparable to that of the depth sensor.

SENSORS (2022)