4.6 Article

3D human pose estimation from depth maps using a deep combination of poses

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jvcir.2018.07.010

关键词

3D human pose; Body limbs; Depth maps; ConvNets

资金

  1. (ISCIII) of Spain Ministry of Economy, Industry and Competitiveness [TIN2016-75279-P, IFI16/00033]
  2. FEDER

向作者/读者索取更多资源

Many real-world applications require the estimation of human body joints for higher-level tasks as, for example, human behaviour understanding. In recent years, depth sensors have become a popular approach to obtain three-dimensional information. The depth maps generated by these sensors provide information that can be employed to disambiguate the poses observed in two-dimensional images. This work addresses the problem of 3D human pose estimation from depth maps employing a Deep Learning approach. We propose a model, named Deep Depth Pose (DDP), which receives a depth map containing a person and a set of predefined 3D prototype poses and returns the 3D position of the body joints of the person. In particular, DDP is defined as a ConvNet that computes the specific weights needed to linearly combine the prototypes for the given input. We have thoroughly evaluated DDP on the challenging 'ITOP' and 'UBC3V' datasets, which respectively depict realistic and synthetic samples, defining a new state-of-the-art on them.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Editorial Material Computer Science, Artificial Intelligence

Preface to the Special Issue on Human Pose, Motion, Activities and Shape in 3D

Manuel J. Marin-Jimenez, Javier Romero, Hao Li, Gregory Rogez

INTERNATIONAL JOURNAL OF COMPUTER VISION (2022)

Article Computer Science, Hardware & Architecture

CAVLCU: an efficient GPU-based implementation of CAVLC

Antonio Fuentes-Alventosa, Juan Gomez-Luna, Jose Maria Gonzalez-Linares, Nicolas Guil, R. Medina-Carnicer

Summary: CAVLC, a high-performance entropy method for video and image compression, is widely used in the H.264 standard. While hardware accelerators have been designed, high-performance software implementations of CAVLC, especially GPU-based ones, are limited. In this paper, a new efficient GPU-based implementation of CAVLC called CAVLCU is introduced, which outperforms existing GPU-based implementations.

JOURNAL OF SUPERCOMPUTING (2022)

Article Computer Science, Artificial Intelligence

GUD-Canny: a real-time GPU-based unsupervised and distributed Canny edge detector

Antonio Fuentes-Alventosa, Juan Gomez-Luna, R. Medina-Carnicer

Summary: The Canny algorithm is a commonly used edge detector with superior performance in noisy environments, but it suffers from a time-consuming process. To address the limitations of GPU implementations, a novel GPU-based unsupervised and distributed Canny edge detector is proposed in this paper, which achieves real-time requirements and outperforms existing GPU and FPGA implementations.

JOURNAL OF REAL-TIME IMAGE PROCESSING (2022)

Article Chemistry, Analytical

Reflection-Aware Generation and Identification of Square Marker Dictionaries

Sergio Garrido-Jurado, Juan Garrido, David Jurado-Rodriguez, Francisco Vazquez, Rafael Munoz-Salinas

Summary: Square markers are commonly used for camera localization due to their robustness, accuracy, and detection speed. However, most systems do not consider the possibility of observing reflected markers, which can lead to detection errors. This research focuses on reflection-aware square marker dictionaries and presents new algorithms for generating and identifying them. The experimental results show that the proposed approach outperforms existing dictionaries in terms of inter-marker distance and the optimization process significantly improves them.

SENSORS (2022)

Article Computer Science, Artificial Intelligence

Assessing polygonal approximations: A new measurement and a comparative study

Nicolas Luis Fernandez-Garcia, Luis Del-Moral Martinez, Angel Carmona-Poyato, Francisco Jose Madrid-Cuevas, Rafael Medina-Carnicer

Summary: This document presents two proposals regarding the evaluation of polygonal approximations. Firstly, a new measurement called normalized compression ratio and adjustment error (NCA) is proposed to provide a fair evaluation of the performance of polygonal approximations of 2D closed curves. Secondly, a new evaluation methodology based on the optimal quality curve concept is proposed for assessing the measurements. The experiments show that NCA obtains the best results and can be used to fairly evaluate the performance of polygonal approximations.

PATTERN RECOGNITION (2023)

Article Chemistry, Analytical

sSLAM: Speeded-Up Visual SLAM Mixing Artificial Markers and Temporary Keypoints

Francisco J. Romero-Ramirez, Rafael Munoz-Salinas, Manuel J. Marin-Jimenez, Miguel Cazorla, Rafael Medina-Carnicer

Summary: This paper proposes a novel visual SLAM approach that efficiently combines keypoints and artificial markers, allowing for a substantial reduction in computing time and memory required without noticeably degrading the tracking accuracy.

SENSORS (2023)

Article Computer Science, Interdisciplinary Applications

Planar fiducial markers: a comparative study

David Jurado-Rodriguez, Rafael Munoz-Salinas, Sergio Garrido-Jurado, Rafael Medina-Carnicer

Summary: This study provides a comprehensive evaluation of the most relevant marker systems, comparing them in terms of sensitivity, specificity, accuracy, computational cost, and performance under occlusion. Recommendations on which method to use based on the application requirements are offered.

VIRTUAL REALITY (2023)

Article Chemistry, Analytical

UCO Physical Rehabilitation: New Dataset and Study of Human Pose Estimation Methods on Physical Rehabilitation Exercises

Rafael Aguilar-Ortega, Rafael Berral-Soler, Isabel Jimenez-Velasco, Francisco J. Romero-Ramirez, Manuel Garcia-Marin, Jorge Zafra-Palma, Rafael Munoz-Salinas, Rafael Medina-Carnicer, Manuel J. Marin-Jimenez

Summary: This article introduces the use of deep learning for pose estimation in physical rehabilitation, aiming to help doctors monitor patients' recovery progress more effectively. The study evaluates and compares different pose estimation methods and examines the impact of subject position and camera viewpoint on the results, as well as the necessity of 3D estimation. The findings provide useful insights for optimizing rehabilitation monitoring.

SENSORS (2023)

Article Computer Science, Interdisciplinary Applications

3D model-based tracking combining edges, keypoints and fiducial markers

David Jurado-Rodriguez, Rafael Munoz-Salinas, Sergio Garrido-Jurado, Francisco J. Romero-Ramirez, Rafael Medina-Carnicer

Summary: This paper proposes a novel approach that employs an enhanced model combining edges, keypoints, and fiducial markers for robust and real-time tracking. Experimental results demonstrate that our method outperforms state-of-the-art model-based approaches and suggest that fiducial markers are a good choice for texturing models.

VIRTUAL REALITY (2023)

暂无数据