Yuxiang Zhao

ORCID: 0009-0004-3102-0730
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Human Pose and Action Recognition
  • Hand Gesture Recognition Systems
  • Video Surveillance and Tracking Methods
  • Metaheuristic Optimization Algorithms Research
  • Neural Networks and Applications
  • Tactile and Sensory Interactions
  • Multimodal Machine Learning Applications
  • Traffic Prediction and Management Techniques
  • Anomaly Detection Techniques and Applications
  • Video Analysis and Summarization
  • Gaze Tracking and Assistive Technology
  • Face and Expression Recognition
  • Image Enhancement Techniques
  • Advanced Image Fusion Techniques
  • Advanced Neural Network Applications
  • Advanced Multi-Objective Optimization Algorithms
  • Teleoperation and Haptic Systems
  • Advanced Data Compression Techniques
  • Mobile Crowdsensing and Crowdsourcing
  • Image Processing and 3D Reconstruction
  • Neural dynamics and brain function
  • Data Management and Algorithms
  • Context-Aware Activity Recognition Systems
  • Expert finding and Q&A systems
  • Modular Robots and Swarm Intelligence

China Mobile (China)
2023

Tianshui Normal University
2022

Baidu (China)
2021

Shenzhen Institutes of Advanced Technology
2021

Chinese Academy of Sciences
2021

Institute of Information Science, Academia Sinica
2011

Ta Hwa University of Science and Technology
2009

National Central University
2004-2007

Tamkang University
2001-2003

Long-range and short-range temporal modeling are two complementary crucial aspects of video recognition. Most the state-of-the-arts focus on spatio-temporal then average multiple snippet-level predictions to yield final video-level prediction. Thus, their prediction does not consider features how evolves along dimension. In this paper, we introduce a novel Dynamic Segment Aggregation (DSA) module capture relationship among snippets. To be more specific, attempt generate dynamic kernel for...

10.1145/3474085.3475344 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

10.1016/j.patcog.2009.03.020 article EN Pattern Recognition 2009-04-01

The detection of traffic anomalies is a critical component the intelligent city transportation management sys-tem. Previous works have proposed variety notable in-sights and taken step forward in this field, however, dealing with complex environment remains challenge. Moreover, lack high-quality data complexity scene, motivate us to study problem from hand-crafted perspective. In paper, we propose straightforward efficient framework that includes pre-processing, dynamic track module,...

10.1109/cvprw53098.2021.00450 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2021-06-01

A new approach to optimization problems based on the self-organizing feature maps is proposed. We name algorithm SOM-based (SOMO) algorithm. Through process, good solutions an problem can be simultaneously explored and exploited. An additional advantage of that outputs neural network allow us transform a multi-dimensional fitness landscape into three-dimensional projected landscape. Several simulations are used illustrate effectiveness proposed

10.1109/ijcnn.2004.1380019 article EN 2005-02-22

One particular application of gesture-based systems is to implement a speaking aid for the deaf. For this happen, it requires module that can recognize three-dimensional (3-D) arm movements since movement one four main attributes characterizing sign word. In paper authors propose fuzzy rule-based recognition method. The effectiveness system evaluated by recognizing 3-D involved in Taiwanese Sign Language (TSL).

10.1109/7333.928579 article EN IEEE Transactions on Neural Systems and Rehabilitation Engineering 2001-06-01

The detection of traffic anomalies is a critical component the intelligent city transportation management system. Previous works have proposed variety notable insights and taken step forward in this field, however, dealing with complex environment remains challenge. Moreover, lack high-quality data complexity scene, motivate us to study problem from hand-crafted perspective. In paper, we propose straightforward efficient framework that includes pre-processing, dynamic track module,...

10.48550/arxiv.2105.03827 preprint EN cc-by-sa arXiv (Cornell University) 2021-01-01

In recent years, deep learning, powered by neural networks, has achieved widespread success in solving high-dimensional problems, particularly those with low-dimensional feature structures. This stems from their ability to identify and learn low dimensional features tailored the problems. Understanding how networks extract such during training dynamics remains a fundamental question learning theory. this work, we propose novel perspective interpreting neurons last hidden layer of network as...

10.48550/arxiv.2412.05144 preprint EN arXiv (Cornell University) 2024-12-06

In this paper, we present JanusNet, an efficient CNN model that can perform online background subtraction and robustly detect moving targets using resource-constrained computational hardware on-board unmanned aerial vehicles (UAVs). Most of the existing work on sub-traction either assume camera is stationary or make limiting assumptions about motion camera, structure scene under observation, apparent in video. JanusNet does not have these limitations therefore, applicable to a variety UAV...

10.1109/iccvw54120.2021.00436 article EN 2021-10-01

The object of this paper is to present a set techniques integrated into two low-cost human computer interfaces. Although the interfaces have many potential applications, one main application help disabled persons attain or regain some degree independent communications and control. first interface voice-controlled mouse second an accelerometer-based mouse.

10.4015/s1016237204000475 article EN Biomedical Engineering Applications Basis and Communications 2004-12-25

Support vector machine(SVM) can effectively solve the classification problem with small samples, nonlinear and high dimensions, but it exits weak generalization ability low accuracy. So an improved genetic algorithm(IGA) is introduced in order to propose a new classification(IGASVM) method based on combining GA SVM model. In proposed IGASVM method, self-adaptive control parameter strategy improving convergence speed are into keep diversity of population, promptly reflect premature individual...

10.14257/ijsh.2016.10.5.12 article EN International Journal of Smart Home 2016-05-31

Color quantization is a process of sampling three-dimensional color space (e.g. RGB) to reduce the number colors in image. By reducing discrete subset known as codebook or palette, each pixel original image mapped an entry according these palette colors. In this paper, reinforcement-learning approach proposed. Fuzzy rules, which are used select appropriate parameters for adaptive clustering algorithm applied quantization, built through reinforcement learning. comparing new method with on 30...

10.6180/jase.2011.14.2.07 article EN International Conference on Intelligent Systems and Control 2011-06-01

There are many different approaches to recognition of spatio-temporal patterns. Each has its own merits and disadvantages. In this paper we present a neural-network-based approach pattern recognition. The effectiveness method is evaluated by recognizing 3D arm movements involved in Taiwanese sign language.

10.1109/ijcnn.1999.830850 article EN 2003-01-22

Data-driven black-box model-based optimization (MBO) problems arise in a great number of practical application scenarios, where the goal is to find design over whole space maximizing target function based on static offline dataset. In this work, we consider more general but challenging MBO setting, named constrained (CoMBO), only part can be optimized while rest by environment. A new challenge arising from CoMBO that most observed designs satisfy constraints are mediocre evaluation....

10.1145/3627676.3627685 preprint EN 2023-11-30

The purpose of this study is to design a portable virtual piano. By utilizing optical fiber gloves and the piano software designed by study, user can play anywhere at any time. This consists three major parts: finger tapping identification, hand movement positioning MIDI sound effect simulation. To piano, wears simulates key motions. bending information detected tell when motions are made. Images captured video camera analyzed, locations moving directions positioned, corresponding scales...

10.5281/zenodo.1077090 article EN cc-by Zenodo (CERN European Organization for Nuclear Research) 2010-07-26

Abstract In order to effectively remove the mixed noise, a novel algorithm for image reducing noise combined Pulse Coupled Neural Networks (PCNN) and regularization of Perona-Malik equation (P-M equation) is put forward. Firstly, positions impulse pollution are located by using PCNN, first stage adaptive filtering applied according result detection. Secondly, multi-direction information median local neighborhood noisy information. And then Gaussian in denoised utilizing P-M diffusion...

10.21203/rs.3.rs-1240281/v1 preprint EN cc-by Research Square (Research Square) 2022-07-05

Long-range and short-range temporal modeling are two complementary crucial aspects of video recognition. Most the state-of-the-arts focus on spatio-temporal then average multiple snippet-level predictions to yield final video-level prediction. Thus, their prediction does not consider features how evolves along dimension. In this paper, we introduce a novel Dynamic Segment Aggregation (DSA) module capture relationship among snippets. To be more specific, attempt generate dynamic kernel for...

10.48550/arxiv.2105.12085 preprint EN cc-by arXiv (Cornell University) 2021-01-01
Coming Soon ...