Yongzhen Huang

ORCID: 0000-0003-4389-9805
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Human Pose and Action Recognition
  • Gait Recognition and Analysis
  • Video Surveillance and Tracking Methods
  • Advanced Image and Video Retrieval Techniques
  • Image Retrieval and Classification Techniques
  • Hand Gesture Recognition Systems
  • Diabetic Foot Ulcer Assessment and Management
  • Advanced Neural Network Applications
  • Anomaly Detection Techniques and Applications
  • Face recognition and analysis
  • Advanced Vision and Imaging
  • Visual Attention and Saliency Detection
  • Face and Expression Recognition
  • Remote-Sensing Image Classification
  • Robotics and Sensor-Based Localization
  • Domain Adaptation and Few-Shot Learning
  • Circular RNAs in diseases
  • Multimodal Machine Learning Applications
  • Indoor and Outdoor Localization Technologies
  • Emotion and Mood Recognition
  • Video Analysis and Summarization
  • Algorithms and Data Compression
  • Advanced Clustering Algorithms Research
  • Gaze Tracking and Assistive Technology
  • Topological and Geometric Data Analysis

Beijing Normal University
2021-2025

Center for Excellence in Brain Science and Intelligence Technology
2017-2022

Institute of Automation
2011-2022

University of Chinese Academy of Sciences
2017-2022

Chinese Academy of Sciences
2011-2020

Shandong Institute of Automation
2012-2020

First Affiliated Hospital of Henan University
2017

This paper studies an approach to gait based human identification via similarity learning by deep convolutional neural networks (CNNs). With a pretty small group of labeled multi-view walking videos, we can train recognize the most discriminative changes patterns which suggest change identity. To best our knowledge, this is first work on CNNs for recognition in literature. Here, provide extensive empirical evaluation terms various scenarios, namely, cross-view and cross-walking-condition,...

10.1109/tpami.2016.2545669 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2016-03-23

With the rapid growth of web images, hashing has received increasing interests in large scale image retrieval. Research efforts have been devoted to learning compact binary codes that preserve semantic similarity based on labels. However, most these methods are designed handle simple similarity. The complex multi-level structure images associated with multiple labels not yet well explored. Here we propose a deep ranking method for hash functions multilevel between multi-label images. In our...

10.1109/cvpr.2015.7298763 article EN 2015-06-01

While feedforward deep convolutional neural networks (CNNs) have been a great success in computer vision, it is important to note that the human visual cortex generally contains more feedback than connections. In this paper, we will briefly introduce background of feedbacks cortex, which motivates us develop computational mechanism networks. addition inference traditional networks, loop introduced infer activation status hidden layer neurons according "goal" network, e.g., high-level...

10.1109/iccv.2015.338 article EN 2015-12-01

One key challenging issue of facial expression recognition is to capture the dynamic variation physical structure from videos. In this paper, we propose a part-based hierarchical bidirectional recurrent neural network (PHRNN) analyze information temporal sequences. Our PHRNN models morphological variations and dynamical evolution expressions, which effective extract "temporal features" based on landmarks (geometry information) consecutive frames. Meanwhile, in order complement still...

10.1109/tip.2017.2689999 article EN IEEE Transactions on Image Processing 2017-03-30

Gait recognition, applied to identify individual walking patterns in a long-distance, is one of the most promising video-based biometric technologies. At present, gait recognition methods take whole human body as unit establish spatio-temporal representations. However, we have observed that different parts possess evidently various visual appearances and movement during walking. In latest literature, employing partial features for description has been verified being beneficial recognition....

10.1109/cvpr42600.2020.01423 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Image classification is a hot topic in computer vision and pattern recognition. Feature coding, as key component of image classification, has been widely studied over the past several years, number coding algorithms have proposed. However, there no comprehensive study concerning connections between different methods, especially how they evolved. In this paper, we first make survey on various feature including their motivations mathematical representations, then exploit relations, based which...

10.1109/tpami.2013.113 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2013-06-14

The codebook based (bag-of-words) model is a widely applied for image classification. We analyze recent coding strategies in this model, and find that saliency the fundamental characteristic of coding. means if visual code much closer to descriptor than other codes, it will obtain very strong response. salient representation under maximum pooling operation leads state-of-the-art performance on many databases competitions. However, most current schemes do not recognize role representation, so...

10.1109/cvpr.2011.5995682 article EN 2011-06-01

With the rapid growth of web images, hashing has received increasing interests in large scale image retrieval. Research efforts have been devoted to learning compact binary codes that preserve semantic similarity based on labels. However, most these methods are designed handle simple similarity. The complex multilevel structure images associated with multiple labels not yet well explored. Here we propose a deep ranking method for hash functions between multi-label images. In our approach,...

10.48550/arxiv.1501.06272 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Recently, deep learning-based cross-view gait recognition has become popular owing to the strong capacity of convolutional neural networks (CNNs). Current learning methods often rely on loss functions used widely in task face recognition, e.g., contrastive and triplet loss. These have problem hard negative mining. In this paper, a robust, effective, gait-related function, called angle center (ACL), is proposed learn discriminative features. The function robust different local parts temporal...

10.1109/tip.2019.2926208 article EN IEEE Transactions on Image Processing 2019-07-10

This paper proposes to learn features from sets of labeled raw images. With this method, the problem over-fitting can be effectively suppressed, so that deep CNNs trained scratch with a small number training data, i.e., 420 albums about 30 000 photos. method deal images, no matter if bear temporal structures. A typical approach sequential image analysis usually leverages motions between adjacent frames, while proposed focuses on capturing co-occurrences and frequencies features....

10.1109/tmm.2015.2477681 article EN IEEE Transactions on Multimedia 2015-09-11

The rapid advances of transportation infrastructure have led to a dramatic increase in the demand for smart systems capable monitoring traffic and street safety. Fundamental these applications are community-based evaluation platform benchmark object detection multi-object tracking. To this end, we organize AVSS2017 Challenge on Advanced Traffic Monitoring, conjunction with International Workshop Street Surveillance Safety Security (IWT4S), evaluate state-of-the-art tracking algorithms...

10.1109/avss.2017.8078560 article EN 2017-08-01

Gait recognition is beneficial for a variety of applications, including video surveillance, crime scene investigation, and social security, to mention few. However, gait often suffers from multiple exterior factors in real scenes, such as carrying conditions, wearing overcoats, diverse viewing angles. Recently, various deep learning-based methods have achieved promising results, but they tend extract one the salient features using fixed-weighted convolutional networks, do not well consider...

10.1109/cvpr52729.2023.02114 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Gait recognition plays a special role in visual surveillance due to its unique advantage, <i>e.g.</i>, long-distance, cross-view and non-cooperative recognition. However, it has not yet been widely applied. One reason for this awkwardness is the lack of truly big dataset captured practical outdoor scenarios. Here, &#x201C;big&#x201D; at least means: (1) huge amount gait videos, (2) sufficient subjects, (3) rich attributes, (4) spatial temporal variations. Moreover, most existing large-scale...

10.1109/tpami.2022.3183288 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-01-01

Gait depicts individuals' unique and distinguishing walking patterns has become one of the most promising biometric features for human identification. As a fine-grained recognition task, gait is easily affected by many factors usually requires large amount completely annotated data that costly insatiable. This paper proposes large-scale self-supervised benchmark with contrastive learning, aiming to learn general representation from massive unlabelled videos practical applications via...

10.1109/tpami.2023.3312419 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2023-09-06
Coming Soon ...