Qiming Zhang

ORCID: 0009-0007-4065-0891
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Image and Video Retrieval Techniques
  • Remote-Sensing Image Classification
  • Image Retrieval and Classification Techniques
  • Advanced Graph Theory Research
  • Limits and Structures in Graph Theory
  • Visual Attention and Saliency Detection
  • Graph theory and applications

The University of Sydney
2024

Shanghai University
2024

In the field of Moving Infrared Small Target Detection (MIRSTD), current methods typically use sequential modeling with two individual modules for spatial and temporal processing. However, such a strategy lacks clear guidance on motion displacement difference between moving targets background noise, thereby limiting feature discriminability resulting in error-prone target localization. This paper addresses this issue from clip frame levels proposes novel architecture MOCID MIRSTD. For...

10.1609/aaai.v39i10.33087 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Obtaining image-level class labels for Remote Sensing (RS) images is a relatively straightforward process, sparking significant interest in Weakly Supervised Semantic Segmentation (WSSS). However, RS present challenges beyond those encountered generic WSSS, including complex backgrounds, densely distributed small objects, and considerable scale variations. To address above issues, we introduce COnsistency-COnstrained Multi-Class Attention model, noted as <italic...

10.1109/tgrs.2024.3392737 article EN IEEE Transactions on Geoscience and Remote Sensing 2024-01-01

Due to spatial redundancy in remote sensing images, sparse tokens containing rich information are usually involved self-attention (SA) reduce the overall token numbers within calculation, avoiding high computational cost issue Vision Transformers. However, such methods obtain by hand-crafted or parallel-unfriendly designs, posing a challenge reach better balance between efficiency and performance. Different from them, this paper proposes use learnable meta formulate tokens, which effectively...

10.24963/ijcai.2024/103 article EN 2024-07-26

Due to spatial redundancy in remote sensing images, sparse tokens containing rich information are usually involved self-attention (SA) reduce the overall token numbers within calculation, avoiding high computational cost issue Vision Transformers. However, such methods obtain by hand-crafted or parallel-unfriendly designs, posing a challenge reach better balance between efficiency and performance. Different from them, this paper proposes use learnable meta formulate tokens, which effectively...

10.48550/arxiv.2405.09789 preprint EN arXiv (Cornell University) 2024-05-15
Coming Soon ...