Yuang Zhang

ORCID: 0000-0002-4702-3352
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Video Surveillance and Tracking Methods
  • Anomaly Detection Techniques and Applications
  • Advanced Neural Network Applications
  • EEG and Brain-Computer Interfaces
  • Human Pose and Action Recognition
  • Emotion and Mood Recognition
  • Autonomous Vehicle Technology and Safety
  • Blind Source Separation Techniques
  • Advanced Computing and Algorithms
  • Multimodal Machine Learning Applications
  • Video Analysis and Summarization
  • Software-Defined Networks and 5G
  • Internet Traffic Analysis and Secure E-voting
  • Advanced Optical Network Technologies
  • Generative Adversarial Networks and Image Synthesis
  • Image Enhancement Techniques
  • Advanced Vision and Imaging
  • Online and Blended Learning
  • Human Mobility and Location-Based Analysis
  • Face and Expression Recognition
  • Educational Technology and Pedagogy
  • Advanced Optical Imaging Technologies
  • Network Security and Intrusion Detection
  • ECG Monitoring and Analysis
  • Gaze Tracking and Assistive Technology

Shanghai Jiao Tong University
2021-2024

Shandong Normal University
2017-2024

Wuhan University
2023-2024

McMaster University
2023

Southeast University
2023

University of Illinois Urbana-Champaign
2021

John Wiley & Sons (United States)
2020

Hudson Institute
2020

Zhejiang University
1997

State Key Laboratory of Modern Optical Instruments
1997

In this paper, we propose MOTRv2, a simple yet effective pipeline to bootstrap end-to-end multi-object tracking with pretrained object detector. Existing methods, e.g. MOTR [43] and TrackFormer [20] are inferior their tracking-by-detection counterparts mainly due poor detection performance. We aim improve by elegantly incorporating an extra first adopt the anchor formulation of queries then use detector generate proposals as anchors, providing prior MOTR. The modification greatly eases...

10.1109/cvpr52729.2023.02112 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

In this paper, we propose a new query-based detection framework for crowd detection. Previous detectors suffer from two drawbacks: first, multiple predictions will be inferred single object, typically in crowded scenes; second, the performance saturates as depth of decoding stage increases. Benefiting nature one-to-one label assignment rule, progressive predicting method to address above issues. Specifically, first select accepted queries prone generate true positive predictions, then refine...

10.1109/cvpr52688.2022.00093 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Emotion recognition has become an important component of human–computer interaction systems. Research on emotion based electroencephalogram (EEG) signals are mostly conducted by the analysis all channels' EEG signals. Although some progresses achieved, there still several challenges such as high dimensions, correlation between different features and feature redundancy in realistic experimental process. These have hindered applications to portable systems (or devices). This paper explores how...

10.1002/int.22295 article EN International Journal of Intelligent Systems 2020-10-08

Referring video object segmentation (RVOS) aims at segmenting an in a following human instruction. Current state-of-the-art methods fall into offline pattern, which each clip independently interacts with text embedding for cross-modal understanding. They usually present that the pattern is necessary RVOS, yet model limited temporal association within clip. In this work, we break up previous belief and propose simple effective online using explicit query propagation, named OnlineRefer....

10.1109/iccv51070.2023.00259 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Colorization is a traditional computer vision task and it plays an important role in many time-consuming tasks, such as old film restoration. Existing methods suffer from unsaturated color temporally inconsistency. In this paper, we propose novel pipeline to overcome the challenges. We regard colorization generative introduce Stable Video Diffusion (SVD) our base model. design palette-based guider assist model generating vivid consistent colors. The context introduced by palette not only...

10.48550/arxiv.2501.19331 preprint EN arXiv (Cornell University) 2025-01-31

Pedestrian detection in a crowd is challenging task due to high number of mutually-occluding human instances, which brings ambiguity and optimization difficulties the current IoU-based ground truth assignment procedure classical object methods. In this paper, we develop unique perspective pedestrian as variational inference problem. We formulate novel efficient algorithm for by modeling dense proposals latent variable while proposing customized Auto-Encoding Variational Bayes (AEVB)...

10.1109/cvpr46437.2021.01145 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

With the development of Vehicular Ad-hoc Networks (VANETs), several data security challenges are revealed, such as hijacking and interception. Although vehicles authorized, malicious behaviors still be carried out. Security lapses may lead to potential accidents, which emphasizes importance laying a solid foundation for VANETs. Thanks base layer provided by cryptography technologies, problems can solved in VANETs avoid accidents. However, trust management focuses on analysis identification...

10.1109/jiot.2024.3363755 article EN IEEE Internet of Things Journal 2024-02-14

Abstract Objective . Emotion recognition on the basis of electroencephalography (EEG) signals has received a significant amount attention in areas cognitive science and human–computer interaction (HCI). However, most existing studies either focus one-dimensional EEG data, ignoring relationship between channels, or only extract time–frequency features while not involving spatial features. Approach We develop spatial–temporal features-based emotion using graph convolution network (GCN) long...

10.1088/1361-6579/acd675 article EN Physiological Measurement 2023-05-17

Although the existing networks are more often deployed in multidomain environment, most of researches focus on single-domain and there no appropriate solutions for virtual network mapping problem. In fact, studies assume that underlying can operate without any interruption. However, physical cannot ensure normal provision services external reasons traditional have difficulties to meet user needs, especially high security requirements transmission. order solve above problems, this paper...

10.1155/2017/5258010 article EN cc-by Security and Communication Networks 2017-01-01

Eye blinks take an important role for electroencephalography (EEG) signals that on one hand, they can severely impact the EEG signals, and other may provide useful information brain–computer interface (BCI) scientific applications. In particular, it is challenging to detect in real time from a single channel of signals. this work, we propose short windowed random forest based method toward Real-Time Blink detection (RT-Blink), which balances processing granularity computation complexity....

10.1109/jsen.2022.3232176 article EN IEEE Sensors Journal 2023-01-04

Although end-to-end multi-object trackers like MOTR enjoy the merits of simplicity, they suffer from conflict between detection and association seriously, resulting in unsatisfactory convergence dynamics. While MOTRv2 partly addresses this problem, it demands an additional network for assistance. In work, we serve as first to reveal that arises unfair label assignment detect queries track during training, where these recognize targets associate them. Based on observation, propose MOTRv3,...

10.48550/arxiv.2305.14298 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Summary Network virtualization is a promising technology, which used to solve the existing shortcomings of Internet architecture and in virtual network (VN) embedding plays vital role resource allocation. However, current algorithms mainly deal with static VN embedding; allocations resources are constant excessive requests' lifetime. In reality, user's requirements usually change time; hence, can lead low utilization substrate reduction operators' revenues. To aforementioned problem, we...

10.1002/cpe.4516 article EN Concurrency and Computation Practice and Experience 2018-04-27

In this paper, we propose MOTRv2, a simple yet effective pipeline to bootstrap end-to-end multi-object tracking with pretrained object detector. Existing methods, MOTR and TrackFormer are inferior their tracking-by-detection counterparts mainly due poor detection performance. We aim improve by elegantly incorporating an extra first adopt the anchor formulation of queries then use detector generate proposals as anchors, providing prior MOTR. The modification greatly eases conflict between...

10.48550/arxiv.2211.09791 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Referring video object segmentation (RVOS) aims at segmenting an in a following human instruction. Current state-of-the-art methods fall into offline pattern, which each clip independently interacts with text embedding for cross-modal understanding. They usually present that the pattern is necessary RVOS, yet model limited temporal association within clip. In this work, we break up previous belief and propose simple effective online using explicit query propagation, named OnlineRefer....

10.48550/arxiv.2307.09356 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Haze severely affects the reliability of vision-based industrial systems, and most current dehazing methods are not applicable to images captured by ultra-high-definition (UHD) cameras. In this article, we propose a novel dimensional transformation mixer (DMixer) model for recovering haze-free from UHD haze images. DMixer, module encodes complete image in multiple stages different perspectives associates features views permuting tensor efficient long-range dependency modeling. way, global...

10.1109/tii.2023.3331114 article EN IEEE Transactions on Industrial Informatics 2023-11-20

Unsupervised Outlier Detection (UOD) is an important data mining task. With the advance of deep learning, (OD) has received broad interest. Most UOD models are trained exclusively on clean datasets to learn distribution normal data, which requires huge manual efforts real-world if possible. Instead relying datasets, some approaches directly train and detect unlabeled contaminated leading need for methods that robust such conditions. Ensemble emerged as a superior solution enhance model...

10.1145/3637528.3671943 preprint EN arXiv (Cornell University) 2024-05-21
Coming Soon ...