NFDI4DS | UHH-SEMS - Publication Details

Pan-Mamba: Effective pan-sharpening with state space model

OPENALEX - Publications

Xuanhua He Ke Cao Jie Zhang Keyu Yan Yingying Wang and 4 more

10.1016/j.inffus.2024.102779 article EN Information Fusion 2024-11-01

Multiscale Dual-Domain Guidance Network for Pan-Sharpening

OPENALEX - Publications

Xuanhua He Keyu Yan Jie Zhang Rui Li Chengjun Xie and 2 more

The goal of pan-sharpening is to produce a high-spatial-resolution multi-spectral (HRMS) image from low-spatial-resolution (LRMS) counterpart by super-resolving the LRMS one under guidance texture-rich panchromatic (PAN) image. Existing research has concentrated on using spatial information generate HRMS images, but neglected investigate frequency domain, which severely restricts performance improvement. In this work, we propose novel approach, named Multi-Scale Dual-Domain Guidance Network...

10.1109/tgrs.2023.3273334 article EN IEEE Transactions on Geoscience and Remote Sensing 2023-01-01

Cross-Modality Interaction Network for Pan-sharpening

OPENALEX - Publications

Yingying Wang Xuanhua He Yuhang Dong Yunlong Lin Yue Huang and 1 more

10.1109/tgrs.2024.3412683 article EN IEEE Transactions on Geoscience and Remote Sensing 2024-01-01

Pan-Sharpening With Wavelet-Enhanced High-Frequency Information

OPENALEX - Publications

Jie Zhang Xuanhua He Keyu Yan Ke Cao Rui Li and 3 more

Pan-sharpening is essentially a panchromatic (PAN)-guided super-resolution process, primarily focused on enhancing multi-spectral image quality. This methodology intricately incorporates the high-frequency derived from texture-rich PAN images into lower-resolution (LRMS) counterparts. However, current spatial domain techniques frequently face challenges in accurately restoring texture details, while frequency methods lack efficient interaction with domains, thus restricting overall model...

10.1109/tgrs.2024.3367165 article EN IEEE Transactions on Geoscience and Remote Sensing 2024-01-01

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

OPENALEX - Publications

Ke Cao Jing Wang Ao Ma Jiasong Feng Zhanjie Zhang and 6 more

The Diffusion Transformer plays a pivotal role in advancing text-to-image and text-to-video generation, owing primarily to its inherent scalability. However, existing controlled diffusion transformer methods incur significant parameter computational overheads suffer from inefficient resource allocation due their failure account for the varying relevance of control information across different layers. To address this, we propose Relevance-Guided Efficient Controllable Generation framework,...

10.48550/arxiv.2502.14377 preprint EN arXiv (Cornell University) 2025-02-20

High-order state space model for multi-modal accelerated MRI reconstruction

OPENALEX - Publications

Bei Tang Xuanhua He Si Wang Y. H. Zhan Gang Yang

10.1109/lsp.2025.3560594 article EN IEEE Signal Processing Letters 2025-01-01

Frequency-Adaptive Pan-Sharpening with Mixture of Experts

OPENALEX - Publications

Xuanhua He Keyu Yan Rui Li Chengjun Xie Jie Zhang and 1 more

Pan-sharpening involves reconstructing missing high-frequency information in multi-spectral images with low spatial resolution, using a higher-resolution panchromatic image as guidance. Although the inborn connection frequency domain, existing pan-sharpening research has not almost investigated potential solution upon domain. To this end, we propose novel Frequency Adaptive Mixture of Experts (FAME) learning framework for pan-sharpening, which consists three key components: Separation...

10.1609/aaai.v38i3.27984 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Learning Diffusion High-Quality Priors for Pan-sharpening: A Two-Stage Approach with Time-Aware Adapter Fine-Tuning

OPENALEX - Publications

Yingying Wang Yunlong Lin Xuanhua He Hui Zheng Keyu Yan and 3 more

10.1109/tgrs.2025.3538744 article EN IEEE Transactions on Geoscience and Remote Sensing 2025-01-01

Towards Generalizable Pan-sharpening: Conditional Flow-based Learning Guided by Implicit High-frequency Priors

OPENALEX - Publications

Yingying Wang Hui Zheng Feifei Li Yunlong Lin Linyu Fan and 3 more

10.1109/tgrs.2025.3539013 article EN IEEE Transactions on Geoscience and Remote Sensing 2025-01-01

Pan-Mamba: Effective pan-sharpening with State Space Model

OPENALEX - Publications

Xuanhua He Ke Cao Keyu Yan Rui Li Chengjun Xie and 2 more

Pan-sharpening involves integrating information from lowresolution multi-spectral and high-resolution panchromatic images to generate counterparts. While recent advancements in the state space model, particularly efficient long-range dependency modeling achieved by Mamba, have revolutionized computer vision community, its untapped potential pan-sharpening motivates our exploration. Our contribution, Pan-Mamba, represents a novel pansharpening network that leverages efficiency of Mamba model...

10.48550/arxiv.2402.12192 preprint EN arXiv (Cornell University) 2024-02-19

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

OPENALEX - Publications

Xuanhua He Quande Liu Shengju Qian Xin Wang Tao Hu and 4 more

Generating high fidelity human video with specified identities has attracted significant attention in the content generation community. However, existing techniques struggle to strike a balance between training efficiency and identity preservation, either requiring tedious case-by-case finetuning or usually missing details process. In this study, we present ID-Animator, zero-shot human-video approach that can perform personalized given single reference facial image without further training....

10.48550/arxiv.2404.15275 preprint EN arXiv (Cornell University) 2024-04-23

Probing Synergistic High-Order Interaction for Multi-modal Image Fusion

OPENALEX - Publications

Man Zhou Naishan Zheng Xuanhua He Danfeng Hong Jocelyn Chanussot

Multi-modal image fusion aims to generate a fused by integrating and distinguishing the cross-modality complementary information from multiple source images. While cross-attention mechanism with global spatial interactions appears promising, it only captures second-order interactions, neglecting higher-order in both channel dimensions. This limitation hampers exploitation of synergies between multi-modalities. To bridge this gap, we introduce Synergistic High-order Interaction Paradigm...

10.1109/tpami.2024.3475485 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-01-01

Pyramid Dual Domain Injection Network for Pan-sharpening

OPENALEX - Publications

Xuanhua He Keyu Yan Rui Li Chengjun Xie Jie Zhang and 1 more

Pan-sharpening, a panchromatic image guided low-spatial-resolution multi-spectral super-resolution task, aims to reconstruct the missing high-frequency information of high-resolution counterpart. Although inborn connection with frequency domain, existing pan-sharpening research has almost investigated potential solution upon thus limiting model performance improvement. To this end, we first revisit degradation process in Fourier space, and then devise Pyramid Dual Domain Injection Network...

10.1109/iccv51070.2023.01186 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Spatially-Adaptive Large-Kernel Network for Efficient Image Super-Resolution

OPENALEX - Publications

Xuanhua He Ke Cao Tao Hu Jie Zhang Rui Li

10.1109/lsp.2024.3445714 article EN IEEE Signal Processing Letters 2024-01-01

Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain

OPENALEX - Publications

Xuanhua He Tao Hu Guoli Wang Zejin Wang Run Wang and 7 more

RAW to sRGB mapping, which aims convert images from smartphones into RGB form equivalent that of Digital Single-Lens Reflex (DSLR) cameras, has become an important area research. However, current methods often ignore the difference between cell phone and DSLR camera images, a goes beyond color matrix extends spatial structure due resolution variations. Recent directly rebuild mapping via shared deep representation, limiting optimal performance. Inspired by Image Signal Processing (ISP)...

10.1609/aaai.v38i3.27985 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain

OPENALEX - Publications

Xuanhua He Tao Hu Guoli Wang Zejin Wang Run Wang and 7 more

RAW to sRGB mapping, which aims convert images from smartphones into RGB form equivalent that of Digital Single-Lens Reflex (DSLR) cameras, has become an important area research. However, current methods often ignore the difference between cell phone and DSLR camera images, a goes beyond color matrix extends spatial structure due resolution variations. Recent directly rebuild mapping via shared deep representation, limiting optimal performance. Inspired by Image Signal Processing (ISP)...

10.48550/arxiv.2401.02161 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion

OPENALEX - Publications

Ke Cao Xuanhua He Tao Hu Chengjun Xie Jie Zhang and 2 more

Multi-modal image fusion integrates complementary information from different modalities to produce enhanced and informative images. Although State-Space Models, such as Mamba, are proficient in long-range modeling with linear complexity, most Mamba-based approaches use fixed scanning strategies, which can introduce biased prior information. To mitigate this issue, we propose a novel Bayesian-inspired strategy called Random Shuffle, supplemented by an theoretically-feasible inverse shuffle...

10.48550/arxiv.2409.01728 preprint EN arXiv (Cornell University) 2024-09-03

Frequency Decomposition-Driven Network for JPEG Artifacts Removal

OPENALEX - Publications

Ke Cao Xuanhua He Keyu Yan Tao Hu Rui Li and 2 more

10.1109/icme57554.2024.10688323 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2024-07-15

Training-Free Large Model Priors for Multiple-in-One Image Restoration

OPENALEX - Publications

Xuanhua He Lang Li Yingying Wang Hui Zheng Ke Cao and 5 more

Image restoration aims to reconstruct the latent clear images from their degraded versions. Despite notable achievement, existing methods predominantly focus on handling specific degradation types and thus require specialized models, impeding real-world applications in dynamic scenarios. To address this issue, we propose Large Model Driven Restoration framework (LMDIR), a novel multiple-in-one image paradigm that leverages generic priors large multi-modal language models (MMLMs) pretrained...

10.48550/arxiv.2407.13181 preprint EN arXiv (Cornell University) 2024-07-18

Frequency-Adaptive Pan-Sharpening with Mixture of Experts

OPENALEX - Publications

Xuanhua He Keyu Yan Rui Li Chengjun Xie Jie Zhang and 1 more

Pan-sharpening involves reconstructing missing high-frequency information in multi-spectral images with low spatial resolution, using a higher-resolution panchromatic image as guidance. Although the inborn connection frequency domain, existing pan-sharpening research has not almost investigated potential solution upon domain. To this end, we propose novel Frequency Adaptive Mixture of Experts (FAME) learning framework for pan-sharpening, which consists three key components: Separation...

10.48550/arxiv.2401.02151 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Frequency decoupled domain-irrelevant feature learning for Pan-sharpening

OPENALEX - Publications

Jie Zhang Ke Cao Keyu Yan Yunlong Lin Xuanhua He and 5 more

10.1109/tcsvt.2024.3480950 article EN IEEE Transactions on Circuits and Systems for Video Technology 2024-01-01