- Robotics and Sensor-Based Localization
- Advanced Neural Network Applications
- Medical Image Segmentation Techniques
- Domain Adaptation and Few-Shot Learning
- Multimodal Machine Learning Applications
- Advanced Image and Video Retrieval Techniques
- Face and Expression Recognition
- Adversarial Robustness in Machine Learning
- Remote-Sensing Image Classification
- Robot Manipulation and Learning
- Image Retrieval and Classification Techniques
- Image and Object Detection Techniques
- Image and Signal Denoising Methods
- Video Surveillance and Tracking Methods
- Model Reduction and Neural Networks
- 3D Surveying and Cultural Heritage
- Advanced Image Fusion Techniques
- Matrix Theory and Algorithms
- Advanced Steganography and Watermarking Techniques
- Human Pose and Action Recognition
- Sparse and Compressive Sensing Techniques
- 3D Shape Modeling and Analysis
- Natural Language Processing Techniques
- Chaos-based Image/Signal Encryption
- Machine Learning and ELM
Shanghai University
2016-2025
Huaqiao University
2023
Huazhong University of Science and Technology
1992-2021
East China Normal University
2007-2018
Nanyang Technological University
2018
Yanshan University
2015-2016
Hunan University
2004-2009
École Normale Supérieure de Lyon
2007-2008
École Normale Supérieure Paris-Saclay
2008
Shanghai Jiao Tong University
2007
In this paper, we address the semisupervised distance metric learning problem and its applications in classification image retrieval. First, formulate a model by considering information of inner classes interclasses. model, an adaptive parameter is designed to balance metrics intermetrics using data structure. Second, convert minimization whose variable symmetric positive-definite matrix. Third, implementation, deduce intrinsic steepest descent method, which assures that matrix strictly at...
Humans possess a unified cognitive ability to perceive, comprehend, and interact with the physical world. Why can't large language models replicate this holistic understanding? Through systematic analysis of existing training paradigms in vision-language-action (VLA), we identify two key challenges: spurious forgetting, where robot overwrites crucial visual-text alignments, task interference, competing control understanding tasks degrade performance when trained jointly. To overcome these...
Many deep learning models are vulnerable to the adversarial attack, i.e., imperceptible but intentionally-designed perturbations input can cause incorrect output of networks. In this paper, using information geometry, we provide a reasonable explanation for vulnerability models. By considering data space as non-linear with Fisher metric induced from neural network, first propose an attack algorithm termed one-step spectral (OSSA). The method is described by constrained quadratic form matrix,...
The leaderless consensus of fractional-order multi-agent systems (FOMASs) by intermittence sampled data control method is investigated in this brief, for which a distributed protocol presented to reduce the updating rate and working time controllers. Subsequently, Laplace transform stability theory are utilized derive some necessary sufficient criteria that show relations among fractional order, sampling period, communication width, coupling strengths, network topology. What more, it can be...
In the field of autonomous driving, carriers are equipped with a variety sensors, including cameras and LiDARs. However, camera suffers from problems illumination occlusion, LiDAR encounters motion distortion, degenerate environment limited ranging distance. Therefore, fusing information these two sensors deserves to be explored. this paper, we propose fusion network which robustly captures both image point cloud descriptors solve place recognition problem. Our contribution can summarized...
Quaternion singular value decomposition (QSVD) is a robust technique of digital watermarking that extracts high quality watermarks from watermarked images with low distortion. However, the existing QSVD-based schemes face obstacle "explosion complexity" and have much room for improvement in terms real-time, invisibility, robustness. In this paper, we overcome such by introducing new real structure-preserving QSVD algorithm propose novel scheme efficiency. Secret information transmitted...
This paper proposes a robust dual-color watermarking based on quaternion singular value decomposition (QSVD), which can embed large payloads into color images with low distortion, and obtain strong robustness to process image in holistic manner. First, two notes are proposed for designing the scheme, one of is about three correlations found <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$U$...
Continual Learning enables models to learn and adapt new tasks while retaining prior knowledge.Introducing tasks, however, can naturally lead feature entanglement across limiting the model's capability distinguish between domain data.In this work, we propose a method called Feature Realignment through Experts on hyperSpHere in (Fresh-CL). By leveraging predefined fixed simplex equiangular tight frame (ETF) classifiers hypersphere, our model improves separation both intra inter tasks.However,...
Object detection in unmanned aerial vehicle (UAV) remote sensing images poses significant challenges due to unstable image quality, small object sizes, complex backgrounds, and environmental occlusions. Small objects, particular, occupy minimal portions of images, making their accurate highly difficult. Existing multi-scale feature fusion methods address these some extent by aggregating features across different resolutions. However, often fail effectively balance classification localization...
Multimodal Large Language Models (MLLMs) have showcased impressive skills in tasks related to visual understanding and reasoning. Yet, their widespread application faces obstacles due the high computational demands during both training inference phases, restricting use a limited audience within research user communities. In this paper, we investigate design aspects of Small (MSLMs) propose an efficient multimodal assistant named Mipha, which is designed create synergy among various aspects:...
Highly efficient multifunctional materials that exhibit strong microwave absorption and elevated heat conduction are crucial for tackling electromagnetic interference accumulation in miniaturized integrated electronic systems. Nevertheless, simple...