NFDI4DS | UHH-SEMS - Publication Details

Real-Time One-Stream Semantic-Guided Refinement Network for RGB-Thermal Salient Object Detection

OPENALEX - Publications

Fushuo Huo Xuegui Zhu Qian Zhang Ziming Liu Wenchao Yu

Salient Object Detection (SOD) has been widely used in practical applications such as multi-sensor image fusion, remote sensing, and defect detection. Recently, SOD from RGB Thermal (T) rapidly developed due to its robustness extreme situations like low illumination occlusion. However, existing methods all utilize a dual-stream encoder, which significantly increases the computation burdens hinders real-world deployment. To this end, we propose real-time One-stream Semantic-guided Refinement...

10.1109/tim.2022.3185323 article EN IEEE Transactions on Instrumentation and Measurement 2022-01-01

Efficient Context-Guided Stacked Refinement Network for RGB-T Salient Object Detection

OPENALEX - Publications

Fushuo Huo Xuegui Zhu Lei Zhang Qifeng Liu Yu Shu

RGB-T salient object detection (SOD) aims at utilizing the complementary cues of RGB and Thermal (T) modalities to detect segment common objects. However, on one hand, existing methods simply fuse features two without fully considering characters T. On other high computational cost prevents them from real-world applications (e.g., automatic driving, abnormal detection, person re-ID). To this end, we proposed an efficient encoder-decoder network named Context-guided Stacked Refinement Network...

10.1109/tcsvt.2021.3102268 article EN IEEE Transactions on Circuits and Systems for Video Technology 2021-08-03

Efficient Wavelet Boost Learning-Based Multi-stage Progressive Refinement Network for Underwater Image Enhancement

OPENALEX - Publications

Fushuo Huo Bingheng Li Xuegui Zhu

Raw underwater images suffer from low contrast and color cast due to wavelength-selective light scattering attenuation. The distortions in luminance mainly appear at the frequency while that edge texture are high frequency. However, hybrid difficult simultaneously recover for existing methods, which focus on spatial domain. To tackle these issues, we propose a novel deep learning network progressively refine by wavelet boost strategy (PRWNet), both domains. Specifically, Multi-stage...

10.1109/iccvw54120.2021.00221 article EN 2021-10-01

C2KD: Bridging the Modality Gap for Cross-Modal Knowledge Distillation

OPENALEX - Publications

Fushuo Huo Wenchao Xu Jingcai Guo Haozhao Wang Song Guo

10.1109/cvpr52733.2024.01515 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Fast Fusion-Based Dehazing With Histogram Modification and Improved Atmospheric Illumination Prior

OPENALEX - Publications

Fushuo Huo Xuegui Zhu Hongjiang Zeng Qifeng Liu Jian Qiu

Haze can seriously affect the visible and visual quality of outdoor optical sensor systems, e.g., driving assistance, remote sensing, video surveillance. Single image dehazing is an intractable problem due to its ill-posed nature. The main idea paper combining multi-scale fusion strategy prior knowledge, thereby presenting balanced contrast enhancement intrinsic color preservation, efficiently. atmospheric illumination (AIP) has been proved that haze mainly degrades luminance channel rather...

10.1109/jsen.2020.3033713 article EN IEEE Sensors Journal 2020-10-26

Graph Knows Unknowns: Reformulate Zero-Shot Learning as Sample-Level Graph Recognition

OPENALEX - Publications

Jingcai Guo Song Guo Qihua Zhou Ziming Liu Xiaocheng Lu and 1 more

Zero-shot learning (ZSL) is an extreme case of transfer that aims to recognize samples (e.g., images) unseen classes relying on a train-set covering only seen and set auxiliary knowledge semantic descriptors). Existing methods usually resort constructing visual-to-semantics mapping based features extracted from each whole sample. However, since the visual spaces are inherently independent may exist in different manifolds, these easily suffer domain bias problem due classes. Unlike existing...

10.1609/aaai.v37i6.25942 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

UTDNet: A unified triplet decoder network for multimodal salient object detection

OPENALEX - Publications

Fushuo Huo Ziming Liu Jingcai Guo Wenchao Xu Song Guo

10.1016/j.neunet.2023.11.051 article EN Neural Networks 2023-11-24

Three-stream interaction decoder network for RGB-thermal salient object detection

OPENALEX - Publications

Fushuo Huo Xuegui Zhu Bingheng Li

10.1016/j.knosys.2022.110007 article EN Knowledge-Based Systems 2022-10-17

Non-exemplar Online Class-Incremental Continual Learning via Dual-Prototype Self-Augment and Refinement

OPENALEX - Publications

Fushuo Huo Wenchao Xu Jingcai Guo Haozhao Wang Yunfeng Fan

This paper investigates a new, practical, but challenging problem named Non-exemplar Online Class-incremental continual Learning (NO-CL), which aims to preserve the discernibility of base classes without buffering data examples and efficiently learn novel continuously in single-pass (i.e., online) stream. The challenges this task are mainly two-fold: (1) Both suffer from severe catastrophic forgetting as no previous samples available for replay. (2) As online can only be observed once, there...

10.1609/aaai.v38i11.29165 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

DRPT: Disentangled and Recurrent Prompt Tuning for Compositional Zero-Shot Learning

OPENALEX - Publications

Xiaocheng Lu Ziming Liu Song Guo Jingcai Guo Fushuo Huo and 2 more

Compositional Zero-shot Learning (CZSL) aims to recognize novel concepts composed of known knowledge without training samples. Standard CZSL either identifies visual primitives or enhances unseen entities, and as a result, entanglement between state object cannot be fully utilized. Admittedly, vision-language models (VLMs) could naturally cope with through tuning prompts, while uneven leads prompts dragged into local optimum. In this paper, we take further step introduce Disentangled...

10.48550/arxiv.2305.01239 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Towards Unbiased Multi-Label Zero-Shot Learning With Pyramid and Semantic Attention

OPENALEX - Publications

Ziming Liu Song Guo Jingcai Guo Yuanyuan Xu Fushuo Huo

Multi-label zero-shot learning extends conventional single-label to a more realistic scenario that aims at recognizing multiple unseen labels of classes for each input sample. Existing works usually exploit attention mechanism generate the correlation among different labels. However, most them are biased on several <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">major classes</i> while neglect xmlns:xlink="http://www.w3.org/1999/xlink">minor...

10.1109/tmm.2022.3222657 article EN IEEE Transactions on Multimedia 2022-11-16

REQA: Coarse-to-fine assessment of image quality to alleviate the range effect

OPENALEX - Publications

Bingheng Li Fushuo Huo

10.1016/j.jvcir.2023.104043 article EN Journal of Visual Communication and Image Representation 2024-01-03

Reqa: Coarse-to-Fine Assessment of Image Quality to Alleviate the Range Effect

OPENALEX - Publications

Bingheng Li Fushuo Huo

Blind image quality assessment suffers from the range effect, which indicates that on overall range, mean opinion score and predicted MOS are well correlated while focusing a particular narrow correlation is lower. To tackle this problem, novel method proposed coarse-grained metric to fine-grained prediction. Concretely, we utilize global context features local detailed for multi-scale distortion perception. Then, further boost assessment, introduce feedback mechanism, in accord with Human...

10.2139/ssrn.4594552 preprint EN 2023-01-01

(ML)2P-Encoder: On Exploration of Channel-Class Correlation for Multi-Label Zero-Shot Learning

OPENALEX - Publications

Ziming Liu Song Guo Xiaocheng Lu Jingcai Guo Jiewei Zhang and 2 more

Recent studies usually approach multi-label zeroshot learning (MLZSL) with visual-semantic mapping on spatial-class correlation, which can be computationally costly, and worse still, fails to capture fine-grained classspecific semantics. We observe that different channels may have sensitivities classes, correspond specific Such an intrinsic channelclass correlation suggests a potential alternative for the more accurate class-harmonious feature representations. In this paper, our interest is...

10.1109/cvpr52729.2023.02285 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Transient Electromagnetic Voltage Imaging of Dense UXO-Like Targets Based on Improved Mathematical Morphology

OPENALEX - Publications

Xuegui Zhu Yu Shu Chaopeng Luo Fushuo Huo Zhu Wang

Unexploded ordnance (UXO) survey is the foremost task of clearance project. Transient electromagnetic method (TEM) proved effective for UXO survey. However, it still difficult TEM to detect small-size, ultra-shallow and dense targets because large-size devices inversion methods large-scale applications are usually ineffective. In work, in order avoid complex apparent resistivity, voltages acquired by our specified small-loop system used depict a voltage distribution profile, which then...

10.1109/access.2020.3016895 article EN cc-by IEEE Access 2020-01-01

ProCC: Progressive Cross-Primitive Compatibility for Open-World Compositional Zero-Shot Learning

OPENALEX - Publications

Fushuo Huo Wenchao Xu Song Guo Jingcai Guo Haozhao Wang and 2 more

Open-World Compositional Zero-shot Learning (OW-CZSL) aims to recognize novel compositions of state and object primitives in images with no priors on the compositional space, which induces a tremendously large output space containing all possible state-object compositions. Existing works either learn joint embedding or predict simple separate classifiers. However, former method heavily relies external word methods, latter ignores interactions interdependent primitives, respectively. In this...

10.1609/aaai.v38i11.29164 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models

OPENALEX - Publications

Fushuo Huo Wenchao Xu Zhong Zhang Haozhao Wang Zhicheng Chen and 1 more

While Large Vision-Language Models (LVLMs) have rapidly advanced in recent years, the prevalent issue known as `hallucination' problem has emerged a significant bottleneck, hindering their real-world deployments. Existing methods mitigate this mainly from two perspectives: One approach leverages extra knowledge like robust instruction tuning LVLMs with curated datasets or employing auxiliary analysis networks, which inevitable incur additional costs. Another approach, contrastive decoding,...

10.48550/arxiv.2408.02032 preprint EN arXiv (Cornell University) 2024-08-04

Non-Exemplar Online Class-incremental Continual Learning via Dual-prototype Self-augment and Refinement

OPENALEX - Publications

Fushuo Huo Wenchao Xu Jingcai Guo Haozhao Wang Yunfeng Fan and 1 more

This paper investigates a new, practical, but challenging problem named Non-exemplar Online Class-incremental continual Learning (NO-CL), which aims to preserve the discernibility of base classes without buffering data examples and efficiently learn novel continuously in single-pass (i.e., online) stream. The challenges this task are mainly two-fold: (1) Both suffer from severe catastrophic forgetting as no previous samples available for replay. (2) As online can only be observed once, there...

10.48550/arxiv.2303.10891 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Towards Unbiased Multi-label Zero-Shot Learning with Pyramid and Semantic Attention

OPENALEX - Publications

Ziming Liu Song Guo Jingcai Guo Yuanyuan Xu Fushuo Huo

Multi-label zero-shot learning extends conventional single-label to a more realistic scenario that aims at recognizing multiple unseen labels of classes for each input sample. Existing works usually exploit attention mechanism generate the correlation among different labels. However, most them are biased on several major while neglect minor with same importance in samples, and may thus result overly diffused maps cannot sufficiently cover classes. We argue disregarding connection between...

10.48550/arxiv.2203.03483 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Spatiotemporal regularization correlation filter with response feedback

OPENALEX - Publications

Xuegui Zhu Yu Shu Qian Zhang Fushuo Huo

Most existing trackers based on correlation filtering try to introduce different regularization items improve the learning process of tracking object. However, parameters corresponding need be adjusted dynamically meet process. The are often fixed, so they cannot updated for more accurate tracking. We propose a method adaptively adjust spatiotemporal during make use important between response map and quality in confidence description index optimize parameters, training can focus part with...

10.1117/1.jei.31.4.043017 article EN Journal of Electronic Imaging 2022-07-01

REQA: Coarse-to-fine Assessment of Image Quality to Alleviate the Range Effect

OPENALEX - Publications

Bingheng Li Fushuo Huo

Blind image quality assessment (BIQA) of user generated content (UGC) suffers from the range effect which indicates that on overall range, mean opinion score (MOS) and predicted MOS (pMOS) are well correlated; focusing a particular correlation is lower. The reason for deviations both in wide narrow destroy uniformity between pMOS. To tackle this problem, novel method proposed coarse-grained metric to fine-grained prediction. Firstly, we design rank-and-gradient loss metric. keeps order grad...

10.48550/arxiv.2209.01760 preprint EN cc-by arXiv (Cornell University) 2022-01-01

ProCC: Progressive Cross-primitive Compatibility for Open-World Compositional Zero-Shot Learning

OPENALEX - Publications

Fushuo Huo Wenchao Xu Song Guo Jingcai Guo Haozhao Wang and 1 more

Open-World Compositional Zero-shot Learning (OW-CZSL) aims to recognize novel compositions of state and object primitives in images with no priors on the compositional space, which induces a tremendously large output space containing all possible state-object compositions. Existing works either learn joint embedding or predict simple separate classifiers. However, former heavily relies external word methods, latter ignores interactions interdependent primitives, respectively. In this paper,...

10.48550/arxiv.2211.12417 preprint EN cc-by arXiv (Cornell University) 2022-01-01