NFDI4DS | UHH-SEMS - Publication Details

Zhifu Zhao

ORCID: 0000-0002-4136-6362

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5063948295

Research Areas

Human Pose and Action Recognition
Gait Recognition and Analysis
Anomaly Detection Techniques and Applications
Sparse and Compressive Sensing Techniques
Hand Gesture Recognition Systems
Image and Signal Denoising Methods
Video Surveillance and Tracking Methods
Photoacoustic and Ultrasonic Imaging
EEG and Brain-Computer Interfaces
Multimodal Machine Learning Applications
Image Enhancement Techniques
Ultrasonics and Acoustic Wave Propagation
Gaze Tracking and Assistive Technology
Advanced Neural Network Applications
Infrastructure Maintenance and Monitoring
Diabetic Foot Ulcer Assessment and Management
Vehicle License Plate Recognition
Smart Parking Systems Research
Structural Health Monitoring Techniques
Visual Attention and Saliency Detection
Robotics and Sensor-Based Localization
Parkinson's Disease Mechanisms and Treatments
Blind Source Separation Techniques
Microwave Imaging and Scattering Analysis
Context-Aware Activity Recognition Systems

Xidian University
2017-2025

SGM-Net: Skeleton-guided multimodal network for action recognition

OPENALEX - Publications

Jianan Li Xuemei Xie Qingzhe Pan Yuhan Cao Zhifu Zhao and 1 more

10.1016/j.patcog.2020.107356 article EN Pattern Recognition 2020-04-09

Glimpse and Zoom: Spatio-Temporal Focused Dynamic Network for Skeleton-based Action Recognition

OPENALEX - Publications

Zhifu Zhao Ziwei Chen Jianan Li Xiaotian Wang Xuemei Xie and 3 more

GCN-based methods have achieved remarkable performance in skeleton-based action recognition. However, existing not explicitly attempted to remove temporal and spatial redundancy that might introduce additional computational costs. Inspired by the fact humans always tend glimpse at overall motion then zoom into most important spatio-temporal regions, we propose a Spatio Temporal Focused Dynamic Network (STFD-Net) trained with reinforcement learning for Specifically, first global extractor...

10.1109/tcsvt.2024.3358836 article EN IEEE Transactions on Circuits and Systems for Video Technology 2024-01-26

Adaptive Progressive Attention Graph Neural Network for EEG Emotion Recognition

OPENALEX - Publications

Tianzhi Feng Chennan Wu Yi Niu Fu Li Boxun Fu and 3 more

In recent years, numerous neuroscientific studies have shown that human emotions are closely linked to specific brain regions, with these regions exhibiting variability across individuals and emotional states. To fully leverage neural patterns, we propose an Adaptive Progressive Attention Graph Neural Network (APAGNN), which dynamically captures the spatial relationships among during processing. The APAGNN employs three specialized experts progressively analyze topology. first expert global...

10.48550/arxiv.2501.14246 preprint EN arXiv (Cornell University) 2025-01-24

Spatio-Temporal Progressive Attention Model for EEG Classification in Rapid Serial Visual Presentation Task

OPENALEX - Publications

Li Yang Wei Liu Tianzhi Feng Fu Li Chennan Wu and 4 more

As a type of multi-dimensional sequential data, the spatial and temporal dependencies electroencephalogram (EEG) signals should be further investigated. Thus, in this paper, we propose novel spatial-temporal progressive attention model (STPAM) to improve EEG classification rapid serial visual presentation (RSVP) tasks. STPAM first adopts three distinct experts learn topological information brain regions progressively, which is used minimize interference irrelevant regions. Concretely, former...

10.48550/arxiv.2502.00730 preprint EN arXiv (Cornell University) 2025-02-02

Dual Multi-Scale GCN with Deformable Temporal Kernel for Skeleton-based Action Recognition

OPENALEX - Publications

Jianan Li Yangtao Zhou Hua Chu Han Wang Zhifu Zhao and 2 more

10.1109/icassp49660.2025.10890725 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

Exploring interaction: Inner-outer spatial–temporal transformer for skeleton-based mutual action recognition

OPENALEX - Publications

Xiaotian Wang Xiang Jiang Zhifu Zhao Kexin Wang Yifan Yang

10.1016/j.neucom.2025.130007 article EN Neurocomputing 2025-03-01

Real-Time Illegal Parking Detection System Based on Deep Learning

OPENALEX - Publications

Xuemei Xie Chenye Wang Shu Chen Guangming Shi Zhifu Zhao

The increasing illegal parking has become more and serious. Nowadays the methods of detecting illegally parked vehicles are based on background segmentation. However, this method is weakly robust sensitive to environment. Benefitting from deep learning, paper proposes a novel vehicle detection system. Illegal captured by camera firstly located classified famous Single Shot MultiBox Detector (SSD) algorithm. To improve performance, we propose optimize SSD adjusting aspect ratio default box...

10.1145/3094243.3094261 preprint EN 2017-06-02

STDM-transformer: Space-time dual multi-scale transformer network for skeleton-based action recognition

OPENALEX - Publications

Zhifu Zhao Ziwei Chen Jianan Li Xuemei Xie Kai Chen and 2 more

10.1016/j.neucom.2023.126903 article EN Neurocomputing 2023-10-06

Real-Time Vehicle Detection from UAV Imagery

OPENALEX - Publications

Xuemei Xie Wenzhe Yang Guimei Cao Jianxiu Yang Zhifu Zhao and 3 more

Fast and accurate vehicle detection in unmanned aerial (UAV) imagery is a meaningful but challenging task, playing an important role wide range of applications. Due to its tiny size, few features, variable scales imbalance sample problems UAV imagery, current deep learning methods used this task cannot achieve satisfactory performance both accuracy speed, which obvious classical trade-off problem. In paper, we propose single-shot detector, focuses on real-time imagery. We make contributions...

10.1109/bigmm.2018.8499466 article EN 2018-09-01

Knowledge embedded GCN for skeleton-based two-person interaction recognition

OPENALEX - Publications

Jianan Li Xuemei Xie Yuhan Cao Qingzhe Pan Zhifu Zhao and 1 more

10.1016/j.neucom.2019.12.149 article EN Neurocomputing 2020-11-23

Temporal Graph Modeling for Skeleton-based Action Recognition

OPENALEX - Publications

Jianan Li Xuemei Xie Zhifu Zhao Yuhan Cao Qingzhe Pan and 1 more

Graph Convolutional Networks (GCNs), which model skeleton data as graphs, have obtained remarkable performance for skeleton-based action recognition. Particularly, the temporal dynamic of sequence conveys significant information in recognition task. For modeling, GCN-based methods only stack multi-layer 1D local convolutions to extract relations between adjacent time steps. With repeat a lot convolutions, key with non-adjacent distance may be ignored due dilution. Therefore, these still...

10.48550/arxiv.2012.08804 preprint EN other-oa arXiv (Cornell University) 2020-01-01

ROI-CSNet: Compressive sensing network for ROI-aware image recovery

OPENALEX - Publications

Zhifu Zhao Xuemei Xie Chenye Wang Siying Mao Wan Liu and 1 more

10.1016/j.image.2019.06.006 article EN Signal Processing Image Communication 2019-06-14

Multi-Scale Adaptive Skeleton Transformer for action recognition

OPENALEX - Publications

Xiaotian Wang Kai Chen Zhifu Zhao Guangming Shi Xuemei Xie and 2 more

10.1016/j.cviu.2024.104229 article EN Computer Vision and Image Understanding 2024-11-19

A Hybrid-3D Convolutional Network for Video Compressive Sensing

OPENALEX - Publications

Zhifu Zhao Xuemei Xie Wan Liu Qingzhe Pan

Video Compressive Sensing (VCS) works to recover the scene video from limited compressed measurements. VCS was intended sense and in spatial-temporal sensing manner. It is difficult be performed due complexity of design optimization. The most current approaches measure only spatial or temporal domain. However, this would lose correlation VCS. Focus on issue, paper proposes a framework, which uses learned manner hybrid-3D recovery network. In terms technical study, we develop residual block...

10.1109/access.2020.2969290 article EN cc-by IEEE Access 2020-01-01

Visualizing and understanding of learned compressive sensing with residual network

OPENALEX - Publications

Zhifu Zhao Xuemei Xie Chenye Wang Wan Liu Guangming Shi and 1 more

10.1016/j.neucom.2019.05.043 article EN Neurocomputing 2019-05-25

View-normalized Skeleton Generation for Action Recognition

OPENALEX - Publications

Qingzhe Pan Zhifu Zhao Xuemei Xie Jianan Li Yuhan Cao and 1 more

Skeleton-based action recognition has attracted great interest due to low cost of skeleton data acquisition and high robustness external conditions. A challenging problem skeleton-based is the large intra-class gap caused by various viewpoints data, which makes modeling difficult for network. To alleviate this problem, a feasible solution utilize label supervised methods learn view-normalization model. However, since in real scenes acquired from diverse viewpoints, it obtain corresponding...

10.1145/3474085.3475341 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

Self-Constructing Temporal Excitation Graph for Skeleton-Based Action Recognition

OPENALEX - Publications

Jianan Li Zhifu Zhao Jiawen Yang Hua Chu Qingshan Li

Graph convolutional network (GCN)-based methods have obtained remarkable performance and gained widespread attention for skeleton-based human action recognition. These typically apply 1-D local convolutions to model temporal correlations simply utilize multilayer stacking capture long-range dynamics. However, the convolution focuses on relations between adjacent time steps. Also, with repeat of a lot convolutions, key relation nonadjacent distance may be ignored due information dilution....

10.1109/jsen.2023.3306819 article EN IEEE Sensors Journal 2023-08-24

Adaptive Spatio-Temporal Directed Graph Neural Network for Parkinson's Detection using Vertical Ground Reaction Force

OPENALEX - Publications

Xiaotian Wang Shuo Liang Zhifu Zhao Xinyu Cui Kai Chen and 1 more

Vertical Ground Reaction Force (VGRF) signal obtained from foot-worn sensors, also known as plantar data, provides a highly informative and detailed representation of an individual's gait features. Existing methods, such CNNs, LSTMs Transformers, have revealed the efficiency deep learning in Parkinson's Disease (PD) diagnosis using VGRF signal. However, intrinsic topologic graph pressure transmission characteristics data are overlooked those approaches, which essential features for analysis....

10.1145/3581783.3612935 article EN 2023-10-26

View-Normalized and Subject-Independent Skeleton Generation for Action Recognition

OPENALEX - Publications

Qingzhe Pan Zhifu Zhao Xuemei Xie Jianan Li Yuhan Cao and 1 more

Skeleton-based action recognition has attracted great interest in computer vision. For this task, a challenging problem concerns the large intraclass variances of skeleton data, which are mainly caused by diverse viewpoints and subjects, greatly increase difficulty modeling actions through network. To address above problem, we propose variance reduction (VaRe) framework for skeleton-based recognition, consists view-normalization generative adversarial network (VN-GAN), subject-independent...

10.1109/tcsvt.2022.3219864 article EN IEEE Transactions on Circuits and Systems for Video Technology 2022-11-04

Multi-Scale Adaptive Skeleton Transformer for Action Recognition

OPENALEX - Publications

Xiaotian Wang Kai Chen Zhifu Zhao Guangming Shi Xuemei Xie and 1 more

Download This Paper Open PDF in Browser Add to My Library Share: Permalink Using these links will ensure access this page indefinitely Copy URL DOI

10.2139/ssrn.4768672 preprint EN 2024-01-01

Hand-aware graph convolution network for skeleton-based sign language recognition

OPENALEX - Publications

Juan Song Huixuechun Wang Jianan Li Jian Zheng Zhifu Zhao and 1 more

Skeleton-based Sign Language Recognition (SLR) is a challenging research area mainly due to the fast and complex hand movement. Currently, Graph Convolution Networks (GCNs) have been employed in skeleton-based SLR achieved remarkable performance. However, existing GCN-based methods suffer from lack of explicit attention topology which plays an important role sign language representation. To address this issue, we propose novel Hand-Aware Network (HA-GCN) focus on topological relationships...

10.1016/j.jiixd.2024.08.001 article EN cc-by-nc-nd Journal of Information and Intelligence 2024-08-01

High-resolution Ultrasonic Echo Detection with Two-stage Recurrent Neural Network

OPENALEX - Publications

Qingzhe Pan Xuemei Xie Zhifu Zhao Siying Mao Jianan Li and 1 more

Ultrasonic echo methods have been widely researched for the application of flaw detection, where locations are identified by arrival time each echo. The main difficulty is that receiving echoes from consecutive flaws overlap in when close. Over last decades, sparse approximation and neural-network-based used to address this issue. However, these cannot achieve satisfactory performance high-noise severe overlapping scenarios. In paper, we propose a high-resolution ultrasonic detection method...

10.1145/3446999.3447638 article EN 2020-12-25

Adaptive Measurement Network for CS Image Reconstruction

OPENALEX - Publications

Xuemei Xie Yuxiang Wang Guangming Shi Chenye Wang Jiang Du and 1 more

Conventional compressive sensing (CS) reconstruction is very slow for its characteristic of solving an optimization problem. Convolu- tional neural network can realize fast processing while achieving compa- rable results. While CS image recovery with high quality not only de- pends on good algorithms, but also measurements. In this paper, we propose adaptive measurement in which obtained by learning. The new consists a fully-connected layer and ReconNet. has low-dimension output acts as...

10.48550/arxiv.1710.01244 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Perceptual spatial-temporal video compressive sensing network

OPENALEX - Publications

Wan Liu Xuemei Xie Zhifu Zhao Guangming Shi

Deep neural networks have been applied to video compressive sensing (VCS) task recently. The existing DNN-based VCS methods compress and reconstruct the scene only in space or time dimensions, which ignores spatial-temporal correlation of video. And they generally utilize pixel-wise loss as function, causes results be over-smoothed. In this paper, we propose a perceptual network. network, compresses recovers both can preserve Besides, refine by selecting specific feature-wise terms adding...

10.1117/12.2558039 article EN 2020-01-03

Coming Soon ...