Liuhao Ge

ORCID: 0000-0003-2022-6691
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Human Pose and Action Recognition
  • Hand Gesture Recognition Systems
  • Robot Manipulation and Learning
  • Anomaly Detection Techniques and Applications
  • Advanced Vision and Imaging
  • Optical measurement and interference techniques
  • 3D Shape Modeling and Analysis
  • Video Surveillance and Tracking Methods
  • Gait Recognition and Analysis
  • Video Analysis and Summarization
  • Human Motion and Animation

Nanyang Technological University
2016-2021

Despite great progress in 3D pose estimation from single-view images or videos, it remains a challenging task due to the substantial depth ambiguity and severe self-occlusions. Motivated by effectiveness of incorporating spatial dependencies temporal consistencies alleviate these issues, we propose novel graph-based method tackle problem human body hand short sequence 2D joint detections. Particularly, domain knowledge about (body) configurations is explicitly incorporated into graph...

10.1109/iccv.2019.00236 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

This work addresses a novel and challenging problem of estimating the full 3D hand shape pose from single RGB image. Most current methods in analysis monocular images only focus on locations keypoints, which cannot fully express hand. In contrast, we propose Graph Convolutional Neural Network (Graph CNN) based method to reconstruct mesh surface that contains richer information both pose. To train networks with supervision, create large-scale synthetic dataset containing ground truth meshes...

10.1109/cvpr.2019.01109 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

We propose a simple, yet effective approach for real-time hand pose estimation from single depth images using three-dimensional Convolutional Neural Networks (3D CNNs). Image based features extracted by 2D CNNs are not directly suitable 3D due to the lack of spatial information. Our proposed CNN taking volumetric representation image as input can capture structure and accurately regress full in pass. In order make robust variations sizes global orientations, we perform data augmentation on...

10.1109/cvpr.2017.602 article EN 2017-07-01

Convolutional Neural Network (CNN) has shown promising results for 3D hand pose estimation in depth images. Different from existing CNN-based methods that take either 2D images or volumes as the input, our proposed Hand PointNet directly processes point cloud models visible surface of regression. Taking normalized regression network is able to capture complex structures and accurately regress a low dimensional representation pose. In order further improve accuracy fingertips, we design...

10.1109/cvpr.2018.00878 article EN 2018-06-01

Articulated hand pose estimation plays an important role in human-computer interaction. Despite the recent progress, accuracy of existing methods is still not satisfactory, partially due to difficulty embedded high-dimensional and non-linear regression problem. Different from discriminative that regress for with a single depth image, we propose first project query image onto three orthogonal planes utilize these multi-view projections 2D heat-maps which estimate joint positions on each...

10.1109/cvpr.2016.391 article EN 2016-06-01

In this paper, we strive to answer two questions: What is the current state of 3D hand pose estimation from depth images? And, what are next challenges that need be tackled? Following successful Hands Million Challenge (HIM2017), investigate top 10 state-of-the-art methods on three tasks: single frame estimation, tracking, and during object interaction. We analyze performance different CNN structures with regard shape, joint visibility, view point articulation distributions. Our findings...

10.1109/cvpr.2018.00279 article EN 2018-06-01

In this paper, we present a novel method for real-time 3D hand pose estimation from single depth images using Convolutional Neural Networks (CNNs). Image-based features extracted by 2D CNNs are not directly suitable due to the lack of spatial information. Our proposed CNN-based method, taking volumetric representation image as input and extracting input, can capture structure accurately regress full in pass. order make CNN robust variations sizes global orientations, perform data...

10.1109/tpami.2018.2827052 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2018-04-16

3D hand pose estimation has made significant progress recently, where Convolutional Neural Networks (CNNs) play a critical role. However, most of the existing CNN-based methods depend much on training set, while labeling data is laborious and time-consuming. Inspired by point cloud autoencoder presented in self-organizing network (SO-Net), our proposed SO-HandNet aims at making use unannotated to obtain accurate semi-supervised manner. We exploit feature encoder (HFE) extract multi-level...

10.1109/iccv.2019.00706 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Articulated hand pose estimation is one of core technologies in human-computer interaction. Despite the recent progress, most existing methods still cannot achieve satisfactory performance, partly due to difficulty embedded high-dimensional nonlinear regression problem. Most data-driven directly regress 3D from 2D depth image, which fully utilize information. In this paper, we propose a novel multi-view convolutional neural network (CNN)-based approach for estimation. To better exploit...

10.1109/tip.2018.2834824 article EN IEEE Transactions on Image Processing 2018-05-10

Compared with depth-based 3D hand pose estimation, it is more challenging to infer from monocular RGB images, due the substantial depth ambiguity and difficulty of obtaining fully-annotated training data. Different existing learning-based RGB-input approaches that require accurate annotations for training, we propose leverage images can be easily obtained commodity RGB-D cameras during while testing take only inputs joint predictions. In this way, alleviate burden costly in real-world...

10.1109/tpami.2020.2993627 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2020-05-12

Articulated hand pose estimation plays an important role in human-computer interaction. Despite the recent progress, accuracy of existing methods is still not satisfactory, partially due to difficulty embedded high-dimensional and non-linear regression problem. Different from discriminative that regress for with a single depth image, we propose first project query image onto three orthogonal planes utilize these multi-view projections 2D heat-maps which estimate joint positions on each...

10.48550/arxiv.1606.07253 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Vision-based hand pose estimation is important in human-computer interaction. While many recent works focus on full degree-of-freedom estimation, robust of global remains a challenging problem. This paper presents novel algorithm to optimize the leaf weights Hough forest assist with single depth camera. Different from traditional forest, we propose learn vote stored at nodes principled way minimize average prediction error, so that ambiguous votes are largely suppressed during fusion....

10.1109/tcyb.2017.2779800 article EN IEEE Transactions on Cybernetics 2017-12-22

This work addresses a novel and challenging problem of estimating the full 3D hand shape pose from single RGB image. Most current methods in analysis monocular images only focus on locations keypoints, which cannot fully express hand. In contrast, we propose Graph Convolutional Neural Network (Graph CNN) based method to reconstruct mesh surface that contains richer information both pose. To train networks with supervision, create large-scale synthetic dataset containing ground truth meshes...

10.48550/arxiv.1903.00812 preprint EN other-oa arXiv (Cornell University) 2019-01-01

This work proposes an end-to-end approach to estimate full 3D hand pose from stereo cameras. Most existing methods of estimating cameras apply matching obtain depth map and use depth-based solution pose. In contrast, we propose bypass the directly image pairs. The proposed neural network architecture extends any keypoint predictor sparse disparity joints. order effectively train model, a large scale synthetic dataset that is composed pairs ground truth annotations. Experiments show...

10.48550/arxiv.2206.01384 preprint EN cc-by-nc-sa arXiv (Cornell University) 2022-01-01

In this paper, we strive to answer two questions: What is the current state of 3D hand pose estimation from depth images? And, what are next challenges that need be tackled? Following successful Hands Million Challenge (HIM2017), investigate top 10 state-of-the-art methods on three tasks: single frame estimation, tracking, and during object interaction. We analyze performance different CNN structures with regard shape, joint visibility, view point articulation distributions. Our findings...

10.48550/arxiv.1712.03917 preprint EN other-oa arXiv (Cornell University) 2017-01-01
Coming Soon ...