Wenhao Xu

ORCID: 0009-0007-5826-3486
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Image and Video Retrieval Techniques
  • Domain Adaptation and Few-Shot Learning
  • Advanced Measurement and Detection Methods
  • Video Surveillance and Tracking Methods
  • Advanced Neural Network Applications
  • Multimodal Machine Learning Applications
  • Remote Sensing and LiDAR Applications
  • Advanced Decision-Making Techniques

Guangzhou University
2024

10.1016/j.jvcir.2024.104064 article EN Journal of Visual Communication and Image Representation 2024-02-07

Visual grounding, as a crucial multimodal reasoning task, aims to locate target objects in images based on natural language queries. This task requires the model perform fusion and effectively. Early methods often rely complex manually designed modules for reasoning. However, these are usually customized certain specific scenarios, thus limiting generalization ability of model. Recent works achieve visual grounding through attention mechanism, which can capture alignment relationship between...

10.1145/3652583.3658002 article EN 2024-05-30

10.1109/icip51287.2024.10647506 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2024-09-27
Coming Soon ...