- Advanced Neural Network Applications
- Neural Networks and Applications
- Video Surveillance and Tracking Methods
- Domain Adaptation and Few-Shot Learning
- Human Pose and Action Recognition
- Robotics and Sensor-Based Localization
- Anomaly Detection Techniques and Applications
- Advanced Optical Sensing Technologies
- Advanced Memory and Neural Computing
- Industrial Vision Systems and Defect Detection
- Remote Sensing and LiDAR Applications
- Machine Learning in Materials Science
- Advanced Image and Video Retrieval Techniques
- Model Reduction and Neural Networks
- Machine Learning and ELM
- Adversarial Robustness in Machine Learning
- COVID-19 diagnosis using AI
- Ferroelectric and Negative Capacitance Devices
- Multimodal Machine Learning Applications
- Generative Adversarial Networks and Image Synthesis
MIT University
2022-2025
RMIT University
2022-2025
Beijing Institute of Technology
2023
South China University of Technology
2019-2021
Australian Regenerative Medicine Institute
2021
Monash University
2021
Knowledge distillation becomes a de facto standard to improve the performance of small neural networks. Most previous works propose regress representational features from teacher student in one-to-one spatial matching fashion. However, people tend overlook fact that, due architecture differences, semantic information on same location usually vary. This greatly undermines underlying assumption approach. To this end, we novel one-to-all knowledge Specifically, allow each pixel feature be...
Knowledge Distillation has shown very promising ability in transferring learned representation from the larger model (teacher) to smaller one (student). Despite many efforts, prior methods ignore important role of retaining inter-channel correlation features, leading lack capturing intrinsic distribution feature space and sufficient diversity properties features teacher network. To solve issue, we propose novel Inter-Channel Correlation for (ICKD), with which homology student network can...
Recent advances in hand-crafted neural architectures for visual recognition underscore the pressing need to explore architecture designs comprising diverse building blocks. Concurrently, search (NAS) methods have gained traction as a means alleviate human efforts. Nevertheless, question of whether NAS can efficiently and effectively manage diversified spaces featuring disparate candidates, such Convolutional Neural Networks (CNNs) transformers, remains an open question. In this work, we...
The human visual system is capable of not only recognizing individual objects but also comprehending the contextual relationship between them in real-world scenarios, making it highly advantageous for object detection. However, practical applications, such information often available. Previous attempts to compensate this by utilizing cross-modal data as language and statistics obtain priors have been deemed sub-optimal due a semantic gap. To overcome challenge, we present seamless...
In this paper, we explore how to leverage readily available unlabeled data improve semi-supervised human detection performance. For purpose, specifically modify the region proposal network (RPN) for learning on a partially labeled dataset. Based commonly observed false positive types, verification module is developed assess foreground objects in candidate regions provide an important cue filtering RPN's proposals. The remaining proposals with high confidence scores are then used as pseudo...
We propose a GAN-based scene-specific instance synthesis and classification model for semi-supervised pedestrian detection. Instead of collecting unreliable detections from unlabeled data, we adopt class-conditional GAN synthesizing instances to alleviate the problem insufficient labeled data. With help base detector, integrate detection by including post-refinement classifier (PRC) into minimax game. A generator PRC can mutually reinforce each other high-fidelity providing more accurate...
Generating realistic pedestrian instances in a semi-supervised setting is promising but challenging due to the limited labeled data. We propose an unreliable-to-reliable instance translation model (Un2Reliab) conditioned on unreliable which poorly align with pedestrians. Un2Reliab mainly consists of encoder-decoder-like generative network and discriminative network, are jointly trained minimax game. adopt regularization ensure that synthesized semantically similar corresponding ground truth....
Multi-modality fusion and multi-task learning are becoming trendy in 3D autonomous driving scenario, considering robust prediction computation budget. However, naively extending the existing framework to domain of multi-modality remains ineffective even poisonous due notorious modality bias task conflict. Previous works manually coordinate with empirical knowledge, which may lead sub-optima. To mitigate issue, we propose a novel yet simple multi-level gradient calibration across tasks...
Neural Scene Flow Prior (NSFP) and Fast (FNSF) have shown remarkable adaptability in the context of large out-of-distribution autonomous driving. Despite their success, underlying reasons for astonishing generalization capabilities remain unclear. Our research addresses this gap by examining NSFP through lens uniform stability, revealing that its performance is inversely proportional to number input point clouds. This finding sheds light on NSFP's effectiveness handling large-scale cloud...
Self-attention mechanism is the key of Transformer but often criticized for its computation demands. Previous token pruning works motivate their methods from view redundancy still need to load full network and require same memory costs. This paper introduces a novel strategy that simplifies vision transformers reduces computational through selective removal non-essential attention layers, guided by entropy considerations. We identify regarding layer in bottom blocks, subsequent MLP i.e. two...
Knowledge Distillation has shown very promising abil-ity in transferring learned representation from the largermodel (teacher) to smaller one (student).Despitemany efforts, prior methods ignore important role ofretaining inter-channel correlation of features, leading tothe lack capturing intrinsic distribution featurespace and sufficient diversity properties features theteacher network.To solve issue, we propose thenovel Inter-Channel Correlation for Distillation(ICKD), with which homology...
Multi-modality fusion and multi-task learning are becoming trendy in 3D autonomous driving scenario, considering robust prediction computation budget. However, naively extending the existing framework to domain of multi-modality remains ineffective even poisonous due notorious modality bias task conflict. Previous works manually coordinate with empirical knowledge, which may lead sub-optima. To mitigate issue, we propose a novel yet simple multi-level gradient calibration across tasks...
Precise simultaneously localization and mapping is necessary to self-driving cars. In this paper, we present a SLAM system fusing with lidar IMU data. Considering that pose initial value key problem for point cloud ICP alignment, propose method using the Extended Kalman Filter combine yaw obtained by feature scan context preintegrated estimation value, aiming improve of vehicle. addition, adopt loop closure, which beneficial whole reduce accumulative errors. Sufficient experiments are...
Knowledge distillation becomes a de facto standard to improve the performance of small neural networks. Most previous works propose regress representational features from teacher student in one-to-one spatial matching fashion. However, people tend overlook fact that, due architecture differences, semantic information on same location usually vary. This greatly undermines underlying assumption approach. To this end, we novel one-to-all knowledge Specifically, allow each pixel feature be...