Learning Latent Representations of 3D Human Pose with Deep Neural Networks

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI] Structured prediction 0202 electrical engineering, electronic engineering, information engineering Deep learning 02 engineering and technology 3D human pose estimation [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
DOI: 10.1007/s11263-018-1066-6 Publication Date: 2018-01-31T12:07:17Z
ABSTRACT
Most recent approaches to monocular 3D pose estimation rely on Deep Learning. They either train a Convolutional Neural Network to directly regress from an image to a 3D pose, which ignores the dependencies between human joints, or model these dependencies via a max-margin structured learning framework, which involves a high computational cost at inference time. In this paper, we introduce a Deep Learning regression architecture for structured prediction of 3D human pose from monocular images or 2D joint location heatmaps that relies on an overcomplete autoencoder to learn a high-dimensional latent pose representation and accounts for joint dependencies. We further propose an efficient Long Short-Term Memory network to enforce temporal consistency on 3D pose predictions. We demonstrate that our approach achieves state-of-the-art performance both in terms of structure preservation and prediction accuracy on standard 3D human pose estimation benchmarks.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (66)
CITATIONS (61)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....