Guan Huang

ORCID: 0000-0001-5172-9203
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Human Pose and Action Recognition
  • Hand Gesture Recognition Systems
  • Video Surveillance and Tracking Methods
  • Anomaly Detection Techniques and Applications
  • Gait Recognition and Analysis
  • Advanced Vision and Imaging
  • Metal Extraction and Bioleaching
  • Extraction and Separation Processes
  • Indoor and Outdoor Localization Technologies
  • Urban Transport and Accessibility
  • Soil Carbon and Nitrogen Dynamics
  • Soil and Water Nutrient Dynamics
  • Transportation and Mobility Innovations
  • Sharing Economy and Platforms
  • Advanced Technologies in Various Fields
  • Generative Adversarial Networks and Image Synthesis
  • Advanced Sensor and Control Systems
  • Human Motion and Animation
  • Plant nutrient uptake and metabolism
  • Minerals Flotation and Separation Techniques
  • Autonomous Vehicle Technology and Safety
  • Mine drainage and remediation techniques
  • Domain Adaptation and Few-Shot Learning
  • Diabetic Foot Ulcer Assessment and Management

Wuhan University of Technology
2024-2025

Chinese Academy of Sciences
2019-2024

Northwest A&F University
2024

Institute of Soil and Water Conservation
2024

China University of Mining and Technology
2020-2021

Recently, the leading performance of human pose estimation is dominated by top-down methods. Being a fundamental component in training and inference, data processing has not been systematically considered community, to best our knowledge. In this paper, we focus on problem find that devil estimator biased processing. Specifically, investigating standard state-of-the-art approaches mainly including transformation encoding-decoding, results obtained common flipping strategy are unaligned with...

10.1109/cvpr42600.2020.00574 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Existing RGB and CNN-based methods in video action recognition mostly do not distinguish human body from the environment, thus easily overfit scenes objects of training sets. In this work, we present a conceptually simple, general high-performance framework for videos, aiming at person-centric modeling. The method, called Action Machine, is based on person bounding boxes instance-level analysis. It extends Inflated 3D ConvNet (I3D) by adding branch pose estimation 2D CNN pose-based...

10.1109/lsp.2019.2942739 article EN IEEE Signal Processing Letters 2019-09-20

World models, especially in autonomous driving, are trending and drawing extensive attention due to their capacity for comprehending driving environments. The established world model holds immense potential the generation of high-quality videos, policies safe maneuvering. However, a critical limitation relevant research lies its predominant focus on gaming environments or simulated settings, thereby lacking representation real-world scenarios. Therefore, we introduce DriveDreamer, pioneering...

10.48550/arxiv.2309.09777 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Gait benchmarks empower the research community to train and evaluate high-performance gait recognition systems. Even though growing efforts have been devoted cross-view recognition, academia is restricted by current existing databases captured in controlled environment. In this paper, we contribute a new benchmark strong baseline for REcognition Wild (GREW). The GREW dataset constructed from natural videos, which contain hundreds of cameras thousands hours streams open With tremendous manual...

10.1109/tpami.2025.3546482 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2025-01-01

Both accuracy and efficiency are significant for pose estimation tracking in videos. State-of-the-art performance is dominated by two-stages top-down methods. Despite the leading results, these methods impractical real-world applications due to their separated architectures complicated calculation. This paper addresses task of articulated multi-person towards real-time speed. An end-to-end multi-task network (MTN) designed perform human detection, estimation, person re-identification (Re-ID)...

10.48550/arxiv.1908.05593 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Existing methods in video action recognition mostly do not distinguish human body from the environment and easily overfit scenes objects. In this work, we present a conceptually simple, general high-performance framework for trimmed videos, aiming at person-centric modeling. The method, called Action Machine, takes as inputs videos cropped by person bounding boxes. It extends Inflated 3D ConvNet (I3D) adding branch pose estimation 2D CNN pose-based recognition, being fast to train test....

10.48550/arxiv.1812.05770 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Being a fundamental component in training and inference, data processing has not been systematically considered human pose estimation community, to the best of our knowledge. In this paper, we focus on problem find that devil evolution is biased processing. Specifically, by investigating standard state-of-the-art approaches mainly including coordinate system transformation keypoint format (i.e., encoding decoding), results obtained common flipping strategy are unaligned with original ones...

10.48550/arxiv.1911.07524 preprint EN other-oa arXiv (Cornell University) 2019-01-01

The practical application requests both accuracy and efficiency on multi-person pose estimation algorithms. But the high fast inference speed are dominated by top-down methods bottom-up respectively. To make a better trade-off between efficiency, we propose novel framework, SIngle-network with Mimicking Point Learning for Bottom-up Human Pose Estimation (SIMPLE). Specifically, in training process, enable SIMPLE to mimic knowledge from high-performance pipeline, which significantly promotes...

10.1609/aaai.v35i4.16446 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

This study investigates the development dilemma of ride-sharing services using real-world mobility datasets from nine cities and calibrated customers' price detour elasticity. Through massive numerical experiments, this reveals that while can benefit social welfare, it may also lead to a loss revenue for transportation network companies (TNCs) or drivers compared with solo-hailing, which limits TNCs' motivation develop services. Three key factors contributing are identified: (1) low...

10.48550/arxiv.2412.08801 preprint EN arXiv (Cornell University) 2024-12-11

Human pose estimation has witnessed a significant advance thanks to the development of deep learning. Recent human approaches tend directly predict location heatmaps, which causes quantization errors and inevitably deteriorates performance within reduced network output. Aim at solving it, we revisit heatmap-offset aggregation method propose Offset-guided Network (OGN) with an intuitive but effective fusion strategy for both two-stages Mask R-CNN. For estimation, greedy box generation is also...

10.48550/arxiv.1906.01344 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Both appearance cue and constraint are vital for human pose estimation. However, there is a tendency in most existing works to overfitting the former overlook latter. In this paper, we propose Augmentation by Information Dropping (AID) verify tackle dilemma. Alone with AID as prerequisite effectively exploiting its potential, customized training schedules, which designed analyzing pattern of loss performance process from perspective information supplying. experiments, model-agnostic...

10.48550/arxiv.2008.07139 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Human pose estimation are of importance for visual understanding tasks such as action recognition and human-computer interaction. In this work, we present a Multiple Stage High-Resolution Network (Multi-Stage HRNet) to tackling the problem multi-person in images. Specifically, follow top-down pipelines high-resolution representations maintained during single-person estimation. addition, multiple stage network cross feature aggregation adopted further refine keypoint position. The resulting...

10.48550/arxiv.1910.05901 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Action recognition based on 3D skeleton sequences has gained considerable attention in recent years. Due to effectively representing the spatial and temporal characters of sequences, Covariance Matrix (CM) features combined with Long Short-Term Memory (LSTM) network is an effective reasonable roadmap enhance action accuracy. However, CM existing models are computed from raw data without normalization or static normalization. Moreover, a feature calculated all coordinates one frame, treating...

10.1109/access.2020.3037238 article EN cc-by IEEE Access 2020-01-01

The advancement of mine industry produces substantial volumes acidic mining wastewater (AMW) annually, posing significant environmental risks. Porous ceramsite, a typical water treatment material, could be potentially apply for treating AMW with alkaline source. This study delves into the efficiency and mechanism ceramsite (WTC) derived from dredged sludge, biomass waste, source as raw materials. Results show that WTC 8% CaCO3 effectively increased pH to 7 within nearly 60 min. mineral...

10.2139/ssrn.4714892 preprint EN 2024-01-01

Gait benchmarks empower the research community to train and evaluate high-performance gait recognition systems. Even though growing efforts have been devoted cross-view recognition, academia is restricted by current existing databases captured in controlled environment. In this paper, we contribute a new benchmark strong baseline for REcognition Wild (GREW). The GREW dataset constructed from natural videos, which contain hundreds of cameras thousands hours streams open With tremendous manual...

10.48550/arxiv.2205.02692 preprint EN cc-by arXiv (Cornell University) 2022-01-01

The practical application requests both accuracy and efficiency on multi-person pose estimation algorithms. But the high fast inference speed are dominated by top-down methods bottom-up respectively. To make a better trade-off between efficiency, we propose novel framework, SIngle-network with Mimicking Point Learning for Bottom-up Human Pose Estimation (SIMPLE). Specifically, in training process, enable SIMPLE to mimic knowledge from high-performance pipeline, which significantly promotes...

10.48550/arxiv.2104.02486 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Student behavior in the classroom is an important part of analysis process and effectiveness. Due to different physiological psychological states learners environment, it very common that there are differences between same types actions, diversity actions will change over time. To solve online recognition problem kind with high a domain adaptive continual learning method for skeleton-based proposed, which achieves transfer ability models actions. Experiments show this has better...

10.1109/tale52509.2021.9678711 article EN 2021-12-05
Coming Soon ...