NFDI4DS | UHH-SEMS - Publication Details

Spatial-aware stacked regression network for real-time 3D hand pose estimation

03 medical and health sciences 0302 clinical medicine

DOI: 10.1016/j.neucom.2021.01.045 Publication Date: 2021-01-19T02:00:16Z

Abstract Supplemental Material References Cited by

AUTHORS (8)

Pengfei Ren

Haifeng Sun

Weiting Huang

Jiachang Hao

Daixuan Cheng

Qi Qi

Jingyu Wang

Jianxin Liao

ABSTRACT

Abstract Making full use of the spatial information of the depth data is crucial for 3D hand pose estimation from a single depth image. In this paper, we propose a Spatial-aware Stacked Regression Network (SSRN) for fast, robust and accurate 3D hand pose estimation from a single depth image. By adopting a differentiable pose re-parameterization process, our method efficiently encodes the pose-dependent 3D spatial structure of the depth data as spatial-aware representations. Taking such spatial-aware representations as inputs, the stacked regression network utilizes multi-joint spatial context and the 3D spatial relationship between the estimated pose and the depth data to predict a refined hand pose. To further improve the estimation accuracy, we adopt a spatial attention mechanism to reduce the influence of irrelevant features for pose regression. In order to improve the speed of the network, we propose a cross-stage self-distillation mechanism to distill knowledge within the network itself. Experiments on four datasets show that our proposed method achieves state-of-the-art accuracy with high running speed around 330 FPS on a single GPU and 35 FPS on a single CPU.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES (80)

CITATIONS (20)

EXTERNAL LINKS

OPENAIRE - Products CROSSREF - Publications

PlumX Metrics

Spatial-aware stacked regression network for real-time 3D hand pose estimation

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....