Fushuo Huo

ORCID: 0000-0003-1030-7834
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Domain Adaptation and Few-Shot Learning
  • Visual Attention and Saliency Detection
  • Multimodal Machine Learning Applications
  • COVID-19 diagnosis using AI
  • Advanced Neural Network Applications
  • Advanced Image Processing Techniques
  • Image Enhancement Techniques
  • Advanced Image and Video Retrieval Techniques
  • Advanced Image Fusion Techniques
  • Image and Video Quality Assessment
  • Machine Learning and ELM
  • Natural Language Processing Techniques
  • Geophysical Methods and Applications
  • Topic Modeling
  • Dental Research and COVID-19
  • Epilepsy research and treatment
  • Respiratory viral infections research
  • EEG and Brain-Computer Interfaces
  • Geophysical and Geoelectrical Methods
  • Non-Destructive Testing Techniques
  • Gaze Tracking and Assistive Technology
  • Text and Document Classification Technologies
  • Semantic Web and Ontologies
  • Impact of Light on Environment and Health
  • Video Surveillance and Tracking Methods

Hong Kong Polytechnic University
2022-2024

Chongqing University
2020-2022

Salient Object Detection (SOD) has been widely used in practical applications such as multi-sensor image fusion, remote sensing, and defect detection. Recently, SOD from RGB Thermal (T) rapidly developed due to its robustness extreme situations like low illumination occlusion. However, existing methods all utilize a dual-stream encoder, which significantly increases the computation burdens hinders real-world deployment. To this end, we propose real-time One-stream Semantic-guided Refinement...

10.1109/tim.2022.3185323 article EN IEEE Transactions on Instrumentation and Measurement 2022-01-01

RGB-T salient object detection (SOD) aims at utilizing the complementary cues of RGB and Thermal (T) modalities to detect segment common objects. However, on one hand, existing methods simply fuse features two without fully considering characters T. On other high computational cost prevents them from real-world applications (e.g., automatic driving, abnormal detection, person re-ID). To this end, we proposed an efficient encoder-decoder network named Context-guided Stacked Refinement Network...

10.1109/tcsvt.2021.3102268 article EN IEEE Transactions on Circuits and Systems for Video Technology 2021-08-03

Raw underwater images suffer from low contrast and color cast due to wavelength-selective light scattering attenuation. The distortions in luminance mainly appear at the frequency while that edge texture are high frequency. However, hybrid difficult simultaneously recover for existing methods, which focus on spatial domain. To tackle these issues, we propose a novel deep learning network progressively refine by wavelet boost strategy (PRWNet), both domains. Specifically, Multi-stage...

10.1109/iccvw54120.2021.00221 article EN 2021-10-01

10.1109/cvpr52733.2024.01515 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Haze can seriously affect the visible and visual quality of outdoor optical sensor systems, e.g., driving assistance, remote sensing, video surveillance. Single image dehazing is an intractable problem due to its ill-posed nature. The main idea paper combining multi-scale fusion strategy prior knowledge, thereby presenting balanced contrast enhancement intrinsic color preservation, efficiently. atmospheric illumination (AIP) has been proved that haze mainly degrades luminance channel rather...

10.1109/jsen.2020.3033713 article EN IEEE Sensors Journal 2020-10-26

Zero-shot learning (ZSL) is an extreme case of transfer that aims to recognize samples (e.g., images) unseen classes relying on a train-set covering only seen and set auxiliary knowledge semantic descriptors). Existing methods usually resort constructing visual-to-semantics mapping based features extracted from each whole sample. However, since the visual spaces are inherently independent may exist in different manifolds, these easily suffer domain bias problem due classes. Unlike existing...

10.1609/aaai.v37i6.25942 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

This paper investigates a new, practical, but challenging problem named Non-exemplar Online Class-incremental continual Learning (NO-CL), which aims to preserve the discernibility of base classes without buffering data examples and efficiently learn novel continuously in single-pass (i.e., online) stream. The challenges this task are mainly two-fold: (1) Both suffer from severe catastrophic forgetting as no previous samples available for replay. (2) As online can only be observed once, there...

10.1609/aaai.v38i11.29165 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Compositional Zero-shot Learning (CZSL) aims to recognize novel concepts composed of known knowledge without training samples. Standard CZSL either identifies visual primitives or enhances unseen entities, and as a result, entanglement between state object cannot be fully utilized. Admittedly, vision-language models (VLMs) could naturally cope with through tuning prompts, while uneven leads prompts dragged into local optimum. In this paper, we take further step introduce Disentangled...

10.48550/arxiv.2305.01239 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Multi-label zero-shot learning extends conventional single-label to a more realistic scenario that aims at recognizing multiple unseen labels of classes for each input sample. Existing works usually exploit attention mechanism generate the correlation among different labels. However, most them are biased on several <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">major classes</i> while neglect xmlns:xlink="http://www.w3.org/1999/xlink">minor...

10.1109/tmm.2022.3222657 article EN IEEE Transactions on Multimedia 2022-11-16

10.1016/j.jvcir.2023.104043 article EN Journal of Visual Communication and Image Representation 2024-01-03

Blind image quality assessment suffers from the range effect, which indicates that on overall range, mean opinion score and predicted MOS are well correlated while focusing a particular narrow correlation is lower. To tackle this problem, novel method proposed coarse-grained metric to fine-grained prediction. Concretely, we utilize global context features local detailed for multi-scale distortion perception. Then, further boost assessment, introduce feedback mechanism, in accord with Human...

10.2139/ssrn.4594552 preprint EN 2023-01-01

Recent studies usually approach multi-label zeroshot learning (MLZSL) with visual-semantic mapping on spatial-class correlation, which can be computationally costly, and worse still, fails to capture fine-grained classspecific semantics. We observe that different channels may have sensitivities classes, correspond specific Such an intrinsic channelclass correlation suggests a potential alternative for the more accurate class-harmonious feature representations. In this paper, our interest is...

10.1109/cvpr52729.2023.02285 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Unexploded ordnance (UXO) survey is the foremost task of clearance project. Transient electromagnetic method (TEM) proved effective for UXO survey. However, it still difficult TEM to detect small-size, ultra-shallow and dense targets because large-size devices inversion methods large-scale applications are usually ineffective. In work, in order avoid complex apparent resistivity, voltages acquired by our specified small-loop system used depict a voltage distribution profile, which then...

10.1109/access.2020.3016895 article EN cc-by IEEE Access 2020-01-01

Open-World Compositional Zero-shot Learning (OW-CZSL) aims to recognize novel compositions of state and object primitives in images with no priors on the compositional space, which induces a tremendously large output space containing all possible state-object compositions. Existing works either learn joint embedding or predict simple separate classifiers. However, former method heavily relies external word methods, latter ignores interactions interdependent primitives, respectively. In this...

10.1609/aaai.v38i11.29164 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

While Large Vision-Language Models (LVLMs) have rapidly advanced in recent years, the prevalent issue known as `hallucination' problem has emerged a significant bottleneck, hindering their real-world deployments. Existing methods mitigate this mainly from two perspectives: One approach leverages extra knowledge like robust instruction tuning LVLMs with curated datasets or employing auxiliary analysis networks, which inevitable incur additional costs. Another approach, contrastive decoding,...

10.48550/arxiv.2408.02032 preprint EN arXiv (Cornell University) 2024-08-04

This paper investigates a new, practical, but challenging problem named Non-exemplar Online Class-incremental continual Learning (NO-CL), which aims to preserve the discernibility of base classes without buffering data examples and efficiently learn novel continuously in single-pass (i.e., online) stream. The challenges this task are mainly two-fold: (1) Both suffer from severe catastrophic forgetting as no previous samples available for replay. (2) As online can only be observed once, there...

10.48550/arxiv.2303.10891 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Multi-label zero-shot learning extends conventional single-label to a more realistic scenario that aims at recognizing multiple unseen labels of classes for each input sample. Existing works usually exploit attention mechanism generate the correlation among different labels. However, most them are biased on several major while neglect minor with same importance in samples, and may thus result overly diffused maps cannot sufficiently cover classes. We argue disregarding connection between...

10.48550/arxiv.2203.03483 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Most existing trackers based on correlation filtering try to introduce different regularization items improve the learning process of tracking object. However, parameters corresponding need be adjusted dynamically meet process. The are often fixed, so they cannot updated for more accurate tracking. We propose a method adaptively adjust spatiotemporal during make use important between response map and quality in confidence description index optimize parameters, training can focus part with...

10.1117/1.jei.31.4.043017 article EN Journal of Electronic Imaging 2022-07-01

Blind image quality assessment (BIQA) of user generated content (UGC) suffers from the range effect which indicates that on overall range, mean opinion score (MOS) and predicted MOS (pMOS) are well correlated; focusing a particular correlation is lower. The reason for deviations both in wide narrow destroy uniformity between pMOS. To tackle this problem, novel method proposed coarse-grained metric to fine-grained prediction. Firstly, we design rank-and-gradient loss metric. keeps order grad...

10.48550/arxiv.2209.01760 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Open-World Compositional Zero-shot Learning (OW-CZSL) aims to recognize novel compositions of state and object primitives in images with no priors on the compositional space, which induces a tremendously large output space containing all possible state-object compositions. Existing works either learn joint embedding or predict simple separate classifiers. However, former heavily relies external word methods, latter ignores interactions interdependent primitives, respectively. In this paper,...

10.48550/arxiv.2211.12417 preprint EN cc-by arXiv (Cornell University) 2022-01-01
Coming Soon ...