Hu Wang

ORCID: 0000-0003-0965-1590
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Multimodal Machine Learning Applications
  • Domain Adaptation and Few-Shot Learning
  • Advanced Neural Network Applications
  • Advanced Image Processing Techniques
  • Image Enhancement Techniques
  • Advanced Vision and Imaging
  • Advanced Image and Video Retrieval Techniques
  • Music and Audio Processing
  • Speech and Audio Processing
  • Video Surveillance and Tracking Methods
  • Digital Media Forensic Detection
  • Advanced Optical Sensing Technologies
  • Radiomics and Machine Learning in Medical Imaging
  • Image Processing Techniques and Applications
  • Optical measurement and interference techniques
  • Medical Image Segmentation Techniques
  • Robotics and Sensor-Based Localization
  • Inertial Sensor and Navigation
  • Financial Distress and Bankruptcy Prediction
  • Medical Imaging and Analysis
  • Optical Systems and Laser Technology
  • Chaos-based Image/Signal Encryption
  • IoT-based Smart Home Systems
  • Seismology and Earthquake Studies
  • Face recognition and analysis

Mohamed bin Zayed University of Artificial Intelligence
2024

University of Electronic Science and Technology of China
2022-2023

The University of Adelaide
2023

Xinjiang University
2023

Shanghai University of Engineering Science
2022

China Electronics Technology Group Corporation
2018

Tianjin Huanhu Hospital
2018

Xi'an Institute of Optics and Precision Mechanics
2016

The missing modality issue is critical but non-trivial to be solved by multi-modal models. Current methods aiming handle the problem in tasks, either deal with modalities only during evaluation or train separate models specific settings. In addition, these are designed for so example, classification not easily adapted segmentation tasks and vice versa. this paper, we propose Shared-Specific Feature Modelling (ShaSpec) method that considerably simpler more effective than competing approaches...

10.1109/cvpr52729.2023.01524 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

10.1016/j.engappai.2023.105947 article EN Engineering Applications of Artificial Intelligence 2023-02-09

Recently, with the rise of high dynamic range (HDR) display devices, there is a great demand to transfer traditional low (LDR) images into HDR versions. The key success how solve many-to-many mapping problem. However, existing approaches either do not consider constraining solution space or just simply imitate inverse camera imaging pipeline in stages, without directly formulating image generation process. In this work, we address problem by integrating LDR-to-HDR knowledge an UNet...

10.24963/ijcai.2022/196 article EN Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022-07-01

The costly and time-consuming annotation process to produce large training sets for modelling semantic LiDAR segmentation methods has motivated the development of semi-supervised learning (SSL) methods. However, such SSL approaches often concentrate on employing consistency only individual representations. This narrow focus results in limited perturbations that generally fail enable effective learning. Additionally, these employ contrastive based sampling from a set positive negative...

10.48550/arxiv.2407.07171 preprint EN arXiv (Cornell University) 2024-07-09

Audio-visual segmentation (AVS) is an emerging task that aims to accurately segment sounding objects based on audio-visual cues. The success of AVS learning systems depends the effectiveness cross-modal interaction. Such a requirement can be naturally fulfilled by leveraging transformer-based architecture due its inherent ability capture long-range dependencies and flexibility in handling different modalities. However, training issues methods, such as low efficacy cross-attention unstable...

10.48550/arxiv.2407.05358 preprint EN arXiv (Cornell University) 2024-07-07

Automatic lecture recording is an appealing alternative approach to manually lectures in the process of online course making as it can a large extent save labor cost. The key automatic system lecturer tracking, and existing tracking methods tend lose target case lecturer's rapid movement. This article proposes based on MobileNet-SSD face detection Pedestrian Dead Reckoning (PDR) technology solve this problem. First, particle filter algorithm used fuse PDR information with rotation angle...

10.7717/peerj-cs.971 article EN cc-by PeerJ Computer Science 2022-05-17

Previously a novel chaotic transmission method was proposed in air traffic voice communication system, which applied remote radio control transmission. It has better performance than Frequency Shift Keying efficiency, bandwidth and anti-additive-white-noise performance. However, the channel noise great effect on method. Thus problem of modulated signals affected by how to design M-ary system additive color Gauss for data are studied this paper. Firstly, signals' boundary range white is...

10.1109/wcica.2018.8630340 article EN 2018-07-01

To binocular camera, the consistency of optical parameters left and right system is an important factor that will influence overall imaging consistency. In conventional testing procedure system, there lacks specifications suitable for evaluating this paper, considering special requirements a method used to measure camera presented. Based on method, measurement which composed integrating sphere, rotary table CMOS has been established. First, let capture images in normal exposure time under...

10.1117/12.2242265 article EN Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE 2016-09-27

Audio-visual segmentation (AVS) is a challenging task that involves accurately segmenting sounding objects based on audio-visual cues. The effectiveness of learning critically depends achieving accurate cross-modal alignment between sound and visual objects. Successful requires two essential components: 1) dataset with high-quality pixel-level multi-class annotated images associated audio files, 2) model can establish strong links information its corresponding object. However, these...

10.48550/arxiv.2304.02970 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

Abstract Current deep image super-resolution (SR) approaches aim to restore high-resolution images from down-sampled or by assuming degradation simple Gaussian kernels and additive noises. However, these techniques only assume crude approximations of the real-world process, which should involve complex noise patterns that are difficult model using assumptions. In this paper, we propose a more realistic process synthesise low-resolution for SR introducing new Kernel Adversarial Learning...

10.21203/rs.3.rs-3183618/v1 preprint EN cc-by Research Square (Research Square) 2023-07-24

The missing modality issue is critical but non-trivial to be solved by multi-modal models. Current methods aiming handle the problem in tasks, either deal with modalities only during evaluation or train separate models specific settings. In addition, these are designed for so example, classification not easily adapted segmentation tasks and vice versa. this paper, we propose Shared-Specific Feature Modelling (ShaSpec) method that considerably simpler more effective than competing approaches...

10.48550/arxiv.2307.14126 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Considering the conditional constraints in urban environment, traditional method of detecting drones is not applicable, and combining target detection with it an effective solution. We propose a lightweight object model based on YOLOv7. The replaces Backbone network MobileViTv3, which reduces number parameters model; Secondly, SPD-Conv module introduced to increase accuracy small-sized drones; In addition, we integrate SimAM Module (Simple, Parameter-Free Attention Module) find important...

10.1109/icsp58490.2023.10248550 article EN 2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP) 2023-04-21

The problem of missing modalities is both critical and non-trivial to be handled in multi-modal models. It common for tasks that certain contribute more compared other modalities, if those important are missing, the model performance drops significantly. Such fact remains unexplored by current approaches recover representation from feature reconstruction or blind aggregation instead extracting useful information best performing modalities. In this paper, we propose a Learnable Cross-modal...

10.48550/arxiv.2310.01035 preprint EN other-oa arXiv (Cornell University) 2023-01-01

The new generation of organic light emitting diode display is designed to enable the high dynamic range (HDR), going beyond standard (SDR) supported by traditional devices. However, a large quantity videos are still SDR format. Further, most pre-existing compressed at varying degrees for minimizing storage and traffic flow demands. To movie-going experience on devices, converting HDR format (i.e., <italic xmlns:mml="http://www.w3.org/1998/Math/MathML"...

10.1109/tpami.2023.3346921 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2023-12-25

We design a system for risk-analyzing and pricing portfolios of non-performing consumer credit loans. The rapid development lending business consumers heightens the need trading formed by overdue loans as manner risk transferring. However, problem is nontrivial technically related research absent. tackle challenge building bottom-up architecture, in which we model distribution every single loan's repayment rate, followed modeling portfolio's overall rate. To address technical issues...

10.48550/arxiv.2110.15102 preprint EN cc-by-nc-nd arXiv (Cornell University) 2021-01-01

Current deep image super-resolution (SR) approaches attempt to restore high-resolution images from down-sampled or by assuming degradation simple Gaussian kernels and additive noises. However, such processing techniques represent crude approximations of the real-world procedure lowering resolution. In this paper, we propose a more realistic process lower resolution introducing new Kernel Adversarial Learning Super-resolution (KASR) framework deal with SR problem. proposed framework, noises...

10.48550/arxiv.2104.09008 preprint EN other-oa arXiv (Cornell University) 2021-01-01
Coming Soon ...