NFDI4DS | UHH-SEMS - Publication Details

Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks

OPENALEX - Publications

Yujun Cai Liuhao Ge Jun Liu Jianfei Cai Tat‐Jen Cham and 2 more

Despite great progress in 3D pose estimation from single-view images or videos, it remains a challenging task due to the substantial depth ambiguity and severe self-occlusions. Motivated by effectiveness of incorporating spatial dependencies temporal consistencies alleviate these issues, we propose novel graph-based method tackle problem human body hand short sequence 2D joint detections. Particularly, domain knowledge about (body) configurations is explicitly incorporated into graph...

10.1109/iccv.2019.00236 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

3D Hand Shape and Pose Estimation From a Single RGB Image

OPENALEX - Publications

Liuhao Ge Zhou Ren Yuncheng Li Zehao Xue Yingying Wang and 2 more

This work addresses a novel and challenging problem of estimating the full 3D hand shape pose from single RGB image. Most current methods in analysis monocular images only focus on locations keypoints, which cannot fully express hand. In contrast, we propose Graph Convolutional Neural Network (Graph CNN) based method to reconstruct mesh surface that contains richer information both pose. To train networks with supervision, create large-scale synthetic dataset containing ground truth meshes...

10.1109/cvpr.2019.01109 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images

OPENALEX - Publications

Liuhao Ge Hui Liang Junsong Yuan Daniël Thalmann

We propose a simple, yet effective approach for real-time hand pose estimation from single depth images using three-dimensional Convolutional Neural Networks (3D CNNs). Image based features extracted by 2D CNNs are not directly suitable 3D due to the lack of spatial information. Our proposed CNN taking volumetric representation image as input can capture structure and accurately regress full in pass. In order make robust variations sizes global orientations, we perform data augmentation on...

10.1109/cvpr.2017.602 article EN 2017-07-01

Hand PointNet: 3D Hand Pose Estimation Using Point Sets

OPENALEX - Publications

Liuhao Ge Yujun Cai Junwu Weng Junsong Yuan

Convolutional Neural Network (CNN) has shown promising results for 3D hand pose estimation in depth images. Different from existing CNN-based methods that take either 2D images or volumes as the input, our proposed Hand PointNet directly processes point cloud models visible surface of regression. Taking normalized regression network is able to capture complex structures and accurately regress a low dimensional representation pose. In order further improve accuracy fingertips, we design...

10.1109/cvpr.2018.00878 article EN 2018-06-01

Robust 3D Hand Pose Estimation in Single Depth Images: From Single-View CNN to Multi-View CNNs

OPENALEX - Publications

Liuhao Ge Hui Liang Junsong Yuan Daniël Thalmann

Articulated hand pose estimation plays an important role in human-computer interaction. Despite the recent progress, accuracy of existing methods is still not satisfactory, partially due to difficulty embedded high-dimensional and non-linear regression problem. Different from discriminative that regress for with a single depth image, we propose first project query image onto three orthogonal planes utilize these multi-view projections 2D heat-maps which estimate joint positions on each...

10.1109/cvpr.2016.391 article EN 2016-06-01

Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals

OPENALEX - Publications

Shanxin Yuan Guillermo Garcia-Hernando Björn Stenger Gyeongsik Moon Ju Yong Chang and 19 more

In this paper, we strive to answer two questions: What is the current state of 3D hand pose estimation from depth images? And, what are next challenges that need be tackled? Following successful Hands Million Challenge (HIM2017), investigate top 10 state-of-the-art methods on three tasks: single frame estimation, tracking, and during object interaction. We analyze performance different CNN structures with regard shape, joint visibility, view point articulation distributions. Our findings...

10.1109/cvpr.2018.00279 article EN 2018-06-01

Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks

OPENALEX - Publications

Liuhao Ge Hui Liang Junsong Yuan Daniël Thalmann

In this paper, we present a novel method for real-time 3D hand pose estimation from single depth images using Convolutional Neural Networks (CNNs). Image-based features extracted by 2D CNNs are not directly suitable due to the lack of spatial information. Our proposed CNN-based method, taking volumetric representation image as input and extracting input, can capture structure accurately regress full in pass. order make CNN robust variations sizes global orientations, perform data...

10.1109/tpami.2018.2827052 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2018-04-16

SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation With Semi-Supervised Learning

OPENALEX - Publications

Yujin Chen Zhigang Tu Liuhao Ge Dejun Zhang Ruizhi Chen and 1 more

3D hand pose estimation has made significant progress recently, where Convolutional Neural Networks (CNNs) play a critical role. However, most of the existing CNN-based methods depend much on training set, while labeling data is laborious and time-consuming. Inspired by point cloud autoencoder presented in self-organizing network (SO-Net), our proposed SO-HandNet aims at making use unannotated to obtain accurate semi-supervised manner. We exploit feature encoder (HFE) extract multi-level...

10.1109/iccv.2019.00706 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Robust 3D Hand Pose Estimation From Single Depth Images Using Multi-View CNNs

OPENALEX - Publications

Liuhao Ge Hui Liang Junsong Yuan Daniël Thalmann

Articulated hand pose estimation is one of core technologies in human-computer interaction. Despite the recent progress, most existing methods still cannot achieve satisfactory performance, partly due to difficulty embedded high-dimensional nonlinear regression problem. Most data-driven directly regress 3D from 2D depth image, which fully utilize information. In this paper, we propose a novel multi-view convolutional neural network (CNN)-based approach for estimation. To better exploit...

10.1109/tip.2018.2834824 article EN IEEE Transactions on Image Processing 2018-05-10

3D Hand Pose Estimation Using Synthetic Data and Weakly Labeled RGB Images

OPENALEX - Publications

Yujun Cai Liuhao Ge Jianfei Cai Nadia Magnenat Thalmann Junsong Yuan

Compared with depth-based 3D hand pose estimation, it is more challenging to infer from monocular RGB images, due the substantial depth ambiguity and difficulty of obtaining fully-annotated training data. Different existing learning-based RGB-input approaches that require accurate annotations for training, we propose leverage images can be easily obtained commodity RGB-D cameras during while testing take only inputs joint predictions. In this way, alleviate burden costly in real-world...

10.1109/tpami.2020.2993627 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2020-05-12

Robust 3D Hand Pose Estimation in Single Depth Images: from Single-View CNN to Multi-View CNNs

OPENALEX - Publications

Liuhao Ge Hui Liang Junsong Yuan Daniël Thalmann

Articulated hand pose estimation plays an important role in human-computer interaction. Despite the recent progress, accuracy of existing methods is still not satisfactory, partially due to difficulty embedded high-dimensional and non-linear regression problem. Different from discriminative that regress for with a single depth image, we propose first project query image onto three orthogonal planes utilize these multi-view projections 2D heat-maps which estimate joint positions on each...

10.48550/arxiv.1606.07253 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Hough Forest With Optimized Leaves for Global Hand Pose Estimation With Arbitrary Postures

OPENALEX - Publications

Hui Liang Junsong Yuan Jun Lee Liuhao Ge Daniël Thalmann

Vision-based hand pose estimation is important in human-computer interaction. While many recent works focus on full degree-of-freedom estimation, robust of global remains a challenging problem. This paper presents novel algorithm to optimize the leaf weights Hough forest assist with single depth camera. Different from traditional forest, we propose learn vote stored at nodes principled way minimize average prediction error, so that ambiguous votes are largely suppressed during fusion....

10.1109/tcyb.2017.2779800 article EN IEEE Transactions on Cybernetics 2017-12-22

3D Hand Shape and Pose Estimation from a Single RGB Image

OPENALEX - Publications

Liuhao Ge Zhou Ren Yuncheng Li Zehao Xue Yingying Wang and 2 more

This work addresses a novel and challenging problem of estimating the full 3D hand shape pose from single RGB image. Most current methods in analysis monocular images only focus on locations keypoints, which cannot fully express hand. In contrast, we propose Graph Convolutional Neural Network (Graph CNN) based method to reconstruct mesh surface that contains richer information both pose. To train networks with supervision, create large-scale synthetic dataset containing ground truth meshes...

10.48550/arxiv.1903.00812 preprint EN other-oa arXiv (Cornell University) 2019-01-01

End-to-End 3D Hand Pose Estimation from Stereo Cameras

OPENALEX - Publications

Yuncheng Li Zehao Xue Yingying Wang Liuhao Ge Zhou Ren and 1 more

This work proposes an end-to-end approach to estimate full 3D hand pose from stereo cameras. Most existing methods of estimating cameras apply matching obtain depth map and use depth-based solution pose. In contrast, we propose bypass the directly image pairs. The proposed neural network architecture extends any keypoint predictor sparse disparity joints. order effectively train model, a large scale synthetic dataset that is composed pairs ground truth annotations. Experiments show...

10.48550/arxiv.2206.01384 preprint EN cc-by-nc-sa arXiv (Cornell University) 2022-01-01

Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals

OPENALEX - Publications

Shanxin Yuan Guillermo Garcia-Hernando Björn Stenger Gyeongsik Moon Ju Yong Chang and 19 more

In this paper, we strive to answer two questions: What is the current state of 3D hand pose estimation from depth images? And, what are next challenges that need be tackled? Following successful Hands Million Challenge (HIM2017), investigate top 10 state-of-the-art methods on three tasks: single frame estimation, tracking, and during object interaction. We analyze performance different CNN structures with regard shape, joint visibility, view point articulation distributions. Our findings...

10.48550/arxiv.1712.03917 preprint EN other-oa arXiv (Cornell University) 2017-01-01