- Advanced Neural Network Applications
- Video Surveillance and Tracking Methods
- Advanced Image and Video Retrieval Techniques
- Domain Adaptation and Few-Shot Learning
- Multimodal Machine Learning Applications
- Anomaly Detection Techniques and Applications
- Human Pose and Action Recognition
- Advanced Image Fusion Techniques
- Adversarial Robustness in Machine Learning
- Image Enhancement Techniques
- Machine Learning and ELM
- Face recognition and analysis
- Visual Attention and Saliency Detection
- Image and Video Quality Assessment
- Remote-Sensing Image Classification
- Color Science and Applications
- Handwritten Text Recognition Techniques
- Industrial Vision Systems and Defect Detection
- Fire Detection and Safety Systems
- Face and Expression Recognition
- Advanced Sensor and Control Systems
- Image Processing Techniques and Applications
- Medical Imaging Techniques and Applications
- Image and Signal Denoising Methods
- Advanced Vision and Imaging
Wuhan University of Technology
2024
Amazon (United States)
2021-2022
Seattle University
2022
Amazon (Germany)
2021
UC San Diego Health System
2017-2020
University of California, San Diego
2014-2020
Universidad Católica Santo Domingo
2019
Chinese Academy of Sciences
2012-2015
Institute of Automation
2012-2015
Shandong Institute of Automation
2013-2014
In object detection, an intersection over union (IoU) threshold is required to define positives and negatives. An detector, trained with low IoU threshold, e.g. 0.5, usually produces noisy detections. However, detection performance tends degrade increasing the thresholds. Two main factors are responsible for this: 1) overfitting during training, due exponentially vanishing positive samples, 2) inference-time mismatch between IoUs which detector optimal those of input hypotheses. A...
In object detection, the intersection over union (IoU) threshold is frequently used to define positives/negatives. The train a detector defines its quality. While commonly of 0.5 leads noisy (low-quality) detections, detection performance degrades for larger thresholds. This paradox high-quality has two causes: 1) overfitting, due vanishing positive samples large thresholds, and 2) inference-time quality mismatch between test hypotheses. A multi-stage architecture, Cascade R-CNN, composed...
The problem of quantizing the activations a deep neural network is considered. An examination popular binary quantization approach shows that this consists approximating classical non-linearity, hyperbolic tangent, by two functions: piecewise constant sign function, which used in feedforward computations, and linear hard tanh backpropagation step during learning. widely ReLU non-linearity then half-wave Gaussian quantizer (HWGQ) proposed for forward approximation shown to have efficient...
The design of complexity-aware cascaded detectors, combining features very different complexities, is considered. A new cascade procedure introduced, by formulating learning as the Lagrangian optimization a risk that accounts for both accuracy and complexity. boosting algorithm, denoted complexity aware training (CompACT), then derived to solve this optimization. CompACT cascades are shown seek an optimal trade-off between pushing higher later stages, where only few difficult candidate...
Despite increasing efforts on universal representations for visual recognition, few have addressed object detection. In this paper, we develop an effective and efficient detection system that is capable of working various image domains, from human faces traffic signs to medical CT images. Unlike multi-domain models, model does not require prior knowledge the domain interest. This achieved by introduction a new family adaptation layers, based principles squeeze excitation, domain-attention...
In recent years, numerous effective multi-object tracking (MOT) methods are developed because of the wide range applications. Existing performance evaluations MOT usually separate object step from detection by using same fixed results for comparisons. this work, we perform a comprehensive quantitative study on effects accuracy to overall performance, new large-scale University at Albany DETection and tRACking (UA-DETRAC) benchmark dataset. The UA-DETRAC dataset consists 100 challenging video...
In object detection, an intersection over union (IoU) threshold is required to define positives and negatives. An detector, trained with low IoU threshold, e.g. 0.5, usually produces noisy detections. However, detection performance tends degrade increasing the thresholds. Two main factors are responsible for this: 1) overfitting during training, due exponentially vanishing positive samples, 2) inference-time mismatch between IoUs which detector optimal those of input hypotheses. A...
Low-precision networks, with weights and activations quantized to low bit-width, are widely used accelerate inference on edge devices. However, current solutions uniform, using identical bit-width for all filters. This fails account the different sensitivities of filters is suboptimal. Mixed-precision networks address this problem, by tuning individual filter requirements. In work, problem optimal mixed-precision network search (MPS) considered. To circumvent its difficulties discrete space...
We present a plug-in replacement for batch normalization (BN) called exponential moving average (EMAN), which improves the performance of existing student-teacher based self- and semi-supervised learning techniques. Unlike standard BN, where statistics are computed within each batch, EMAN, used in teacher, updates its by from BN student. This design reduces intrinsic cross-sample dependency enhances generalization teacher. EMAN strong baselines self-supervised 4-6/1-2 points about 7/2...
While some efforts have been paid to handle deformation and occlusion in visual tracking, they are still great challenges. In this paper, a dynamic graph-based tracker (DGT) is proposed address these two challenges unified framework. the target graph, nodes local parts encoding appearance information, edges interactions between inner geometric structure information. This graph representation provides much more information for tracking presence of occlusion. The then formulated as undirected...
The problem of quantizing the activations a deep neural network is considered. An examination popular binary quantization approach shows that this consists approximating classical non-linearity, hyperbolic tangent, by two functions: piecewise constant sign function, which used in feedforward computations, and linear hard tanh backpropagation step during learning. ReLU widely recent learning literature, then half-wave Gaussian quantizer (HWGQ) proposed for forward approximation shown to have...
The design of complexity-aware cascaded detectors, combining features very different complexities, is considered. A new cascade procedure introduced, by formulating learning as the Lagrangian optimization a risk that accounts for both accuracy and complexity. boosting algorithm, denoted complexity aware training (CompACT), then derived to solve this optimization. CompACT cascades are shown seek an optimal trade-off between pushing higher later stages, where only few difficult candidate...
In object detection, the intersection over union (IoU) threshold is frequently used to define positives/negatives. The train a detector defines its \textit{quality}. While commonly of 0.5 leads noisy (low-quality) detections, detection performance degrades for larger thresholds. This paradox high-quality has two causes: 1) overfitting, due vanishing positive samples large thresholds, and 2) inference-time quality mismatch between test hypotheses. A multi-stage architecture, Cascade R-CNN,...
Visual tracking is an important but challenging problem in the computer vision field. In real world, appearances of target and its surroundings change continuously over space time, which provides effective information to track robustly. However, enough attention has not been paid spatio-temporal appearance previous works. this paper, a robust context model based tracker presented complete task unconstrained environments. The constructed with temporal spatial models. captures historical...
The problem of pedestrian detection is considered. design complexity-aware cascaded detectors, combining features very different complexities, investigated. A new cascade procedure introduced, by formulating learning as the Lagrangian optimization a risk that accounts for both accuracy and complexity. boosting algorithm, denoted complexity aware training (CompACT), then derived to solve this optimization. CompACT cascades are shown seek an optimal trade-off between pushing higher later...
Despite increasing efforts on universal representations for visual recognition, few have addressed object detection. In this paper, we develop an effective and efficient detection system that is capable of working various image domains, from human faces traffic signs to medical CT images. Unlike multi-domain models, model does not require prior knowledge the domain interest. This achieved by introduction a new family adaptation layers, based principles squeeze excitation, domain-attention...