About
Contact & Profiles
Research Areas
- Advanced Image and Video Retrieval Techniques
- Domain Adaptation and Few-Shot Learning
- Advanced Measurement and Detection Methods
- Video Surveillance and Tracking Methods
- Advanced Neural Network Applications
- Multimodal Machine Learning Applications
- Remote Sensing and LiDAR Applications
- Advanced Decision-Making Techniques
Guangzhou University
2024
10.1016/j.jvcir.2024.104064
article
EN
Journal of Visual Communication and Image Representation
2024-02-07
Visual grounding, as a crucial multimodal reasoning task, aims to locate target objects in images based on natural language queries. This task requires the model perform fusion and effectively. Early methods often rely complex manually designed modules for reasoning. However, these are usually customized certain specific scenarios, thus limiting generalization ability of model. Recent works achieve visual grounding through attention mechanism, which can capture alignment relationship between...
10.1145/3652583.3658002
article
EN
2024-05-30
10.1109/icip51287.2024.10647506
article
EN
2022 IEEE International Conference on Image Processing (ICIP)
2024-09-27
Coming Soon ...