NFDI4DS | UHH-SEMS - Publication Details

The Devil Is in the Details: Delving Into Unbiased Data Processing for Human Pose Estimation

OPENALEX - Publications

Junjie Huang Zheng Zhu Feng Guo Guan Huang

Recently, the leading performance of human pose estimation is dominated by top-down methods. Being a fundamental component in training and inference, data processing has not been systematically considered community, to best our knowledge. In this paper, we focus on problem find that devil estimator biased processing. Specifically, investigating standard state-of-the-art approaches mainly including transformation encoding-decoding, results obtained common flipping strategy are unaligned with...

10.1109/cvpr42600.2020.00574 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Soil nitrogen cycling gene abundances in response to organic amendments: A meta-analysis

OPENALEX - Publications

Yajun Yang Hexiang Liu Yi Chen Lijuan Wu Guan Huang and 1 more

10.1016/j.scitotenv.2024.171048 article EN The Science of The Total Environment 2024-02-20

Sustainable Water Treatment Ceramsite Derived from Dredged Sludge and Biomass Waste for Neutralizing Acidic Mining Wastewater: Mechanisms and Efficiency

OPENALEX - Publications

Guan Huang Shouwei Jian Xin Gao Jianxiang Huang Xiufeng Deng and 2 more

10.1016/j.envres.2025.121360 article EN Environmental Research 2025-03-01

Action Machine: Toward Person-Centric Action Recognition in Videos

OPENALEX - Publications

Jiagang Zhu Wei Zou Zheng Zhu Liang Xu Guan Huang

Existing RGB and CNN-based methods in video action recognition mostly do not distinguish human body from the environment, thus easily overfit scenes objects of training sets. In this work, we present a conceptually simple, general high-performance framework for videos, aiming at person-centric modeling. The method, called Action Machine, is based on person bounding boxes instance-level analysis. It extends Inflated 3D ConvNet (I3D) by adding branch pose estimation 2D CNN pose-based...

10.1109/lsp.2019.2942739 article EN IEEE Signal Processing Letters 2019-09-20

DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

OPENALEX - Publications

Xiaofeng Wang Zheng Zhu Guan Huang Xinze Chen Jiwen Lu

World models, especially in autonomous driving, are trending and drawing extensive attention due to their capacity for comprehending driving environments. The established world model holds immense potential the generation of high-quality videos, policies safe maneuvering. However, a critical limitation relevant research lies its predominant focus on gaming environments or simulated settings, thereby lacking representation real-world scenarios. Therefore, we introduce DriveDreamer, pioneering...

10.48550/arxiv.2309.09777 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Gait Recognition in the Wild: A Large-scale Benchmark and NAS-based Baseline

OPENALEX - Publications

Xianda Guo Zheng Zhu Tian Yang Beibei Lin Junjie Huang and 4 more

Gait benchmarks empower the research community to train and evaluate high-performance gait recognition systems. Even though growing efforts have been devoted cross-view recognition, academia is restricted by current existing databases captured in controlled environment. In this paper, we contribute a new benchmark strong baseline for REcognition Wild (GREW). The GREW dataset constructed from natural videos, which contain hundreds of cameras thousands hours streams open With tremendous manual...

10.1109/tpami.2025.3546482 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2025-01-01

FastPose: Towards Real-time Pose Estimation and Tracking via Scale-normalized Multi-task Networks

OPENALEX - Publications

Jiabin Zhang Zheng Zhu Wei Zou Peng Li Yanwei Li and 2 more

Both accuracy and efficiency are significant for pose estimation tracking in videos. State-of-the-art performance is dominated by two-stages top-down methods. Despite the leading results, these methods impractical real-world applications due to their separated architectures complicated calculation. This paper addresses task of articulated multi-person towards real-time speed. An end-to-end multi-task network (MTN) designed perform human detection, estimation, person re-identification (Re-ID)...

10.48550/arxiv.1908.05593 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Action Machine: Rethinking Action Recognition in Trimmed Videos

OPENALEX - Publications

Jiagang Zhu Wei Zou Liang Xu Yiming Hu Zheng Zhu and 4 more

Existing methods in video action recognition mostly do not distinguish human body from the environment and easily overfit scenes objects. In this work, we present a conceptually simple, general high-performance framework for trimmed videos, aiming at person-centric modeling. The method, called Action Machine, takes as inputs videos cropped by person bounding boxes. It extends Inflated 3D ConvNet (I3D) adding branch pose estimation 2D CNN pose-based recognition, being fast to train test....

10.48550/arxiv.1812.05770 preprint EN other-oa arXiv (Cornell University) 2018-01-01

The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation

OPENALEX - Publications

Junjie Huang Zheng Zhu Feng Guo Guan Huang Dalong Du

Being a fundamental component in training and inference, data processing has not been systematically considered human pose estimation community, to the best of our knowledge. In this paper, we focus on problem find that devil evolution is biased processing. Specifically, by investigating standard state-of-the-art approaches mainly including coordinate system transformation keypoint format (i.e., encoding decoding), results obtained common flipping strategy are unaligned with original ones...

10.48550/arxiv.1911.07524 preprint EN other-oa arXiv (Cornell University) 2019-01-01

SIMPLE: SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation

OPENALEX - Publications

Jiabin Zhang Zheng Zhu Jiwen Lu Junjie Huang Guan Huang and 1 more

The practical application requests both accuracy and efficiency on multi-person pose estimation algorithms. But the high fast inference speed are dominated by top-down methods bottom-up respectively. To make a better trade-off between efficiency, we propose novel framework, SIngle-network with Mimicking Point Learning for Bottom-up Human Pose Estimation (SIMPLE). Specifically, in training process, enable SIMPLE to mimic knowledge from high-performance pipeline, which significantly promotes...

10.1609/aaai.v35i4.16446 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Development dilemma of ride-sharing: Revenue or social welfare?

OPENALEX - Publications

Wang Chen Guan Huang Jintao Ke

This study investigates the development dilemma of ride-sharing services using real-world mobility datasets from nine cities and calibrated customers' price detour elasticity. Through massive numerical experiments, this reveals that while can benefit social welfare, it may also lead to a loss revenue for transportation network companies (TNCs) or drivers compared with solo-hailing, which limits TNCs' motivation develop services. Three key factors contributing are identified: (1) low...

10.48550/arxiv.2412.08801 preprint EN arXiv (Cornell University) 2024-12-11

Exploiting Offset-guided Network for Pose Estimation and Tracking

OPENALEX - Publications

Rui Zhang Zheng Zhu Peng Li Rui Wu Chaoxu Guo and 2 more

Human pose estimation has witnessed a significant advance thanks to the development of deep learning. Recent human approaches tend directly predict location heatmaps, which causes quantization errors and inevitably deteriorates performance within reduced network output. Aim at solving it, we revisit heatmap-offset aggregation method propose Offset-guided Network (OGN) with an intuitive but effective fusion strategy for both two-stages Mask R-CNN. For estimation, greedy box generation is also...

10.48550/arxiv.1906.01344 preprint EN other-oa arXiv (Cornell University) 2019-01-01

AID: Pushing the Performance Boundary of Human Pose Estimation with Information Dropping Augmentation

OPENALEX - Publications

Junjie Huang Zheng Zhu Guan Huang Dalong Du

Both appearance cue and constraint are vital for human pose estimation. However, there is a tendency in most existing works to overfitting the former overlook latter. In this paper, we propose Augmentation by Information Dropping (AID) verify tackle dilemma. Alone with AID as prerequisite effectively exploiting its potential, customized training schedules, which designed analyzing pattern of loss performance process from perspective information supplying. experiments, model-agnostic...

10.48550/arxiv.2008.07139 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Multi-Stage HRNet: Multiple Stage High-Resolution Network for Human Pose Estimation

OPENALEX - Publications

Junjie Huang Zheng Zhu Guan Huang

Human pose estimation are of importance for visual understanding tasks such as action recognition and human-computer interaction. In this work, we present a Multiple Stage High-Resolution Network (Multi-Stage HRNet) to tackling the problem multi-person in images. Specifically, follow top-down pipelines high-resolution representations maintained during single-person estimation. addition, multiple stage network cross feature aggregation adopted further refine keypoint position. The resulting...

10.48550/arxiv.1910.05901 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Optimizing Features Quality: A Normalized Covariance Fusion Framework for Skeleton Action Recognition

OPENALEX - Publications

Guan Huang Qiuyan Yan

Action recognition based on 3D skeleton sequences has gained considerable attention in recent years. Due to effectively representing the spatial and temporal characters of sequences, Covariance Matrix (CM) features combined with Long Short-Term Memory (LSTM) network is an effective reasonable roadmap enhance action accuracy. However, CM existing models are computed from raw data without normalization or static normalization. Moreover, a feature calculated all coordinates one frame, treating...

10.1109/access.2020.3037238 article EN cc-by IEEE Access 2020-01-01

A Novel Substrate for Acidic Mining Wastewater Treatment: Biomass Pore-Forming and Alkaline Modification

OPENALEX - Publications

Guan Huang Shouwei Jian Baodong Li Xin Gao Jianxiang Huang and 2 more

The advancement of mine industry produces substantial volumes acidic mining wastewater (AMW) annually, posing significant environmental risks. Porous ceramsite, a typical water treatment material, could be potentially apply for treating AMW with alkaline source. This study delves into the efficiency and mechanism ceramsite (WTC) derived from dredged sludge, biomass waste, source as raw materials. Results show that WTC 8% CaCO3 effectively increased pH to 7 within nearly 60 min. mineral...

10.2139/ssrn.4714892 preprint EN 2024-01-01

Gait Recognition in the Wild: A Large-scale Benchmark and NAS-based Baseline

OPENALEX - Publications

Zheng Zhu Xianda Guo Tian Yang Junjie Huang Jiankang Deng and 4 more

Gait benchmarks empower the research community to train and evaluate high-performance gait recognition systems. Even though growing efforts have been devoted cross-view recognition, academia is restricted by current existing databases captured in controlled environment. In this paper, we contribute a new benchmark strong baseline for REcognition Wild (GREW). The GREW dataset constructed from natural videos, which contain hundreds of cameras thousands hours streams open With tremendous manual...

10.48550/arxiv.2205.02692 preprint EN cc-by arXiv (Cornell University) 2022-01-01

SIMPLE: SIngle-network with Mimicking and Point Learning for Bottom-up Human Pose Estimation

OPENALEX - Publications

Jiabin Zhang Zheng Zhu Jiwen Lu Junjie Huang Guan Huang and 1 more

The practical application requests both accuracy and efficiency on multi-person pose estimation algorithms. But the high fast inference speed are dominated by top-down methods bottom-up respectively. To make a better trade-off between efficiency, we propose novel framework, SIngle-network with Mimicking Point Learning for Bottom-up Human Pose Estimation (SIMPLE). Specifically, in training process, enable SIMPLE to mimic knowledge from high-performance pipeline, which significantly promotes...

10.48550/arxiv.2104.02486 preprint EN other-oa arXiv (Cornell University) 2021-01-01

A domain adaptive and continual learning method for skeleton behavior recognition in classroom environment

OPENALEX - Publications

Qiuyan Yan Yan Hu Guan Huang Zefang Chen

Student behavior in the classroom is an important part of analysis process and effectiveness. Due to different physiological psychological states learners environment, it very common that there are differences between same types actions, diversity actions will change over time. To solve online recognition problem kind with high a domain adaptive continual learning method for skeleton-based proposed, which achieves transfer ability models actions. Experiments show this has better...

10.1109/tale52509.2021.9678711 article EN 2021-12-05