Learning Latent Representations of 3D Human Pose with Deep Neural Networks - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue International Journal of Computer Vision Année : 2018

Learning Latent Representations of 3D Human Pose with Deep Neural Networks

Résumé

Most recent approaches to monocular 3D pose estimation rely on Deep Learning. They either train a Convo-lutional Neural Network to directly regress from an image to a 3D pose, which ignores the dependencies between human joints, or model these dependencies via a max-margin structured learning framework, which involves a high computational cost at inference time. In this paper, we introduce a Deep Learning regression architecture for structured prediction of 3D human pose from monocular images or 2D joint location heatmaps that relies on an overcomplete autoencoder to learn a high-dimensional latent pose representation and accounts for joint dependencies. We further propose an efficient Long Short-Term Memory (LSTM) network to enforce temporal consistency on 3D pose predictions. We demonstrate that our approach achieves state-of-the-art performance both in terms of structure preservation and prediction accuracy on standard 3D human pose estimation benchmarks.
Fichier principal
Vignette du fichier
ijcv18.pdf (3.93 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02509358 , version 1 (17-03-2020)

Identifiants

Citer

Isinsu Katircioglu, Bugra Tekin, Mathieu Salzmann, Vincent Lepetit, Pascal Fua. Learning Latent Representations of 3D Human Pose with Deep Neural Networks. International Journal of Computer Vision, 2018, 126 (12), pp.1326-1341. ⟨10.1007/s11263-018-1066-6⟩. ⟨hal-02509358⟩

Collections

CNRS
147 Consultations
178 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More