Learning Latent Representations of 3D Human Pose with Deep Neural Networks

Isinsu Katircioglu; Bugra Tekin; Mathieu Salzmann; Vincent Lepetit; Pascal Fua

doi:10.1007/s11263-018-1066-6

Article Dans Une Revue International Journal of Computer Vision Année : 2018

Learning Latent Representations of 3D Human Pose with Deep Neural Networks

(1) , (1) , (1) , (2) , (1)

1
2

Isinsu Katircioglu

Fonction : Auteur

Ecole Polytechnique Fédérale de Lausanne

Bugra Tekin

Fonction : Auteur

Ecole Polytechnique Fédérale de Lausanne

Mathieu Salzmann

Fonction : Auteur

Ecole Polytechnique Fédérale de Lausanne

Vincent Lepetit

Fonction : Auteur
PersonId : 181024
IdHAL : vincent-lepetit
ORCID : 0000-0001-9985-4433
IdRef : 152965343

Laboratoire Bordelais de Recherche en Informatique

Pascal Fua

Fonction : Auteur

Ecole Polytechnique Fédérale de Lausanne

Résumé

Most recent approaches to monocular 3D pose estimation rely on Deep Learning. They either train a Convo-lutional Neural Network to directly regress from an image to a 3D pose, which ignores the dependencies between human joints, or model these dependencies via a max-margin structured learning framework, which involves a high computational cost at inference time. In this paper, we introduce a Deep Learning regression architecture for structured prediction of 3D human pose from monocular images or 2D joint location heatmaps that relies on an overcomplete autoencoder to learn a high-dimensional latent pose representation and accounts for joint dependencies. We further propose an efficient Long Short-Term Memory (LSTM) network to enforce temporal consistency on 3D pose predictions. We demonstrate that our approach achieves state-of-the-art performance both in terms of structure preservation and prediction accuracy on standard 3D human pose estimation benchmarks.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

ijcv18.pdf (3.93 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Vincent Lepetit : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02509358

Soumis le : mardi 17 mars 2020-14:57:26

Dernière modification le : vendredi 24 mars 2023-14:53:15

Dates et versions

hal-02509358 , version 1 (17-03-2020)

Identifiants

HAL Id : hal-02509358 , version 1
DOI : 10.1007/s11263-018-1066-6

Citer

Isinsu Katircioglu, Bugra Tekin, Mathieu Salzmann, Vincent Lepetit, Pascal Fua. Learning Latent Representations of 3D Human Pose with Deep Neural Networks. International Journal of Computer Vision, 2018, 126 (12), pp.1326-1341. ⟨10.1007/s11263-018-1066-6⟩. ⟨hal-02509358⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS

147 Consultations

178 Téléchargements

Learning Latent Representations of 3D Human Pose with Deep Neural Networks

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager