NFDI4DS | UHH-SEMS - Publication Details

Hu Wang

ORCID: 0000-0003-0965-1590

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100401029

Research Areas

Multimodal Machine Learning Applications
Domain Adaptation and Few-Shot Learning
Advanced Neural Network Applications
Advanced Image Processing Techniques
Image Enhancement Techniques
Advanced Vision and Imaging
Advanced Image and Video Retrieval Techniques
Music and Audio Processing
Speech and Audio Processing
Video Surveillance and Tracking Methods
Digital Media Forensic Detection
Advanced Optical Sensing Technologies
Radiomics and Machine Learning in Medical Imaging
Image Processing Techniques and Applications
Optical measurement and interference techniques
Medical Image Segmentation Techniques
Robotics and Sensor-Based Localization
Inertial Sensor and Navigation
Financial Distress and Bankruptcy Prediction
Medical Imaging and Analysis
Optical Systems and Laser Technology
Chaos-based Image/Signal Encryption
IoT-based Smart Home Systems
Seismology and Earthquake Studies
Face recognition and analysis

Mohamed bin Zayed University of Artificial Intelligence
2024

University of Electronic Science and Technology of China
2022-2023

The University of Adelaide
2023

Xinjiang University
2023

Shanghai University of Engineering Science
2022

China Electronics Technology Group Corporation
2018

Tianjin Huanhu Hospital
2018

Xi'an Institute of Optics and Precision Mechanics
2016

Multi-Modal Learning with Missing Modality via Shared-Specific Feature Modelling

OPENALEX - Publications

Hu Wang Yuanhong Chen Congbo Ma Jodie Avery M. Louise Hull and 1 more

The missing modality issue is critical but non-trivial to be solved by multi-modal models. Current methods aiming handle the problem in tasks, either deal with modalities only during evaluation or train separate models specific settings. In addition, these are designed for so example, classification not easily adapted segmentation tasks and vice versa. this paper, we propose Shared-Specific Feature Modelling (ShaSpec) method that considerably simpler more effective than competing approaches...

10.1109/cvpr52729.2023.01524 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Single-image HDR reconstruction by dual learning the camera imaging process

OPENALEX - Publications

She Lei Mao Ye Shuai Li Yu Zhao Ce Zhu and 1 more

10.1016/j.engappai.2023.105947 article EN Engineering Applications of Artificial Intelligence 2023-02-09

KUNet: Imaging Knowledge-Inspired Single HDR Image Reconstruction

OPENALEX - Publications

Hu Wang Mao Ye Xiatian Zhu Shuai Li Ce Zhu and 1 more

Recently, with the rise of high dynamic range (HDR) display devices, there is a great demand to transfer traditional low (LDR) images into HDR versions. The key success how solve many-to-many mapping problem. However, existing approaches either do not consider constraining solution space or just simply imitate inverse camera imaging pipeline in stages, without directly formulating image generation process. In this work, we address problem by integrating LDR-to-HDR knowledge an UNet...

10.24963/ijcai.2022/196 article EN Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022-07-01

ReFs: A hybrid pre-training paradigm for 3D medical image segmentation

OPENALEX - Publications

Yutong Xie Jianpeng Zhang Lingqiao Liu Hu Wang Yiwen Ye and 2 more

10.1016/j.media.2023.103023 article EN Medical Image Analysis 2023-11-08

ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation

OPENALEX - Publications

Yuyuan Liu Yuanhong Chen Hu Wang Vasileios Belagiannis Ian Reid and 1 more

The costly and time-consuming annotation process to produce large training sets for modelling semantic LiDAR segmentation methods has motivated the development of semi-supervised learning (SSL) methods. However, such SSL approaches often concentrate on employing consistency only individual representations. This narrow focus results in limited perturbations that generally fail enable effective learning. Additionally, these employ contrastive based sampling from a set positive negative...

10.48550/arxiv.2407.07171 preprint EN arXiv (Cornell University) 2024-07-09

CPM: Class-conditional Prompting Machine for Audio-visual Segmentation

OPENALEX - Publications

Yuanhong Chen Chong Wang Yuyuan Liu Hu Wang Gustavo Carneiro

Audio-visual segmentation (AVS) is an emerging task that aims to accurately segment sounding objects based on audio-visual cues. The success of AVS learning systems depends the effectiveness cross-modal interaction. Such a requirement can be naturally fulfilled by leveraging transformer-based architecture due its inherent ability capture long-range dependencies and flexibility in handling different modalities. However, training issues methods, such as low efficacy cross-attention unstable...

10.48550/arxiv.2407.05358 preprint EN arXiv (Cornell University) 2024-07-07

Intelligent lecture recording system based on coordination of face-detection and pedestrian dead reckoning

OPENALEX - Publications

Hu Wang Jianpeng Hu

Automatic lecture recording is an appealing alternative approach to manually lectures in the process of online course making as it can a large extent save labor cost. The key automatic system lecturer tracking, and existing tracking methods tend lose target case lecturer's rapid movement. This article proposes based on MobileNet-SSD face detection Pedestrian Dead Reckoning (PDR) technology solve this problem. First, particle filter algorithm used fuse PDR information with rotation angle...

10.7717/peerj-cs.971 article EN cc-by PeerJ Computer Science 2022-05-17

The Design and Research of Data Transmission on Remote Radio Control in Different Noise Channel

OPENALEX - Publications

Hongqiang Li Rui-lin Ju Hu Wang Y. Sun

Previously a novel chaotic transmission method was proposed in air traffic voice communication system, which applied remote radio control transmission. It has better performance than Frequency Shift Keying efficiency, bandwidth and anti-additive-white-noise performance. However, the channel noise great effect on method. Thus problem of modulated signals affected by how to design M-ary system additive color Gauss for data are studied this paper. Firstly, signals' boundary range white is...

10.1109/wcica.2018.8630340 article EN 2018-07-01

Method used to test the imaging consistency of binocular camera's left-right optical system

OPENALEX - Publications

Meiying Liu Hu Wang Jie Liu Yaoke Xue Shaodong Yang and 1 more

To binocular camera, the consistency of optical parameters left and right system is an important factor that will influence overall imaging consistency. In conventional testing procedure system, there lacks specifications suitable for evaluating this paper, considering special requirements a method used to measure camera presented. Based on method, measurement which composed integrating sphere, rotary table CMOS has been established. First, let capture images in normal exposure time under...

10.1117/12.2242265 article EN Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE 2016-09-27

A Closer Look at Audio-Visual Segmentation

OPENALEX - Publications

Yuanhong Chen Yuyuan Liu Hu Wang Fengbei Liu Chong Wang and 1 more

Audio-visual segmentation (AVS) is a challenging task that involves accurately segmenting sounding objects based on audio-visual cues. The effectiveness of learning critically depends achieving accurate cross-modal alignment between sound and visual objects. Successful requires two essential components: 1) dataset with high-quality pixel-level multi-class annotated images associated audio files, 2) model can establish strong links information its corresponding object. However, these...

10.48550/arxiv.2304.02970 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

Kernel Adversarial Learning for Real-world Image Super-resolution

OPENALEX - Publications

Hu Wang Congbo Ma Jianpeng Zhang Wei Emma Zhang Gustavo Carneiro

Abstract Current deep image super-resolution (SR) approaches aim to restore high-resolution images from down-sampled or by assuming degradation simple Gaussian kernels and additive noises. However, these techniques only assume crude approximations of the real-world process, which should involve complex noise patterns that are difficult model using assumptions. In this paper, we propose a more realistic process synthesise low-resolution for SR introducing new Kernel Adversarial Learning...

10.21203/rs.3.rs-3183618/v1 preprint EN cc-by Research Square (Research Square) 2023-07-24

Multi-modal Learning with Missing Modality via Shared-Specific Feature Modelling

OPENALEX - Publications

Hu Wang Yuanhong Chen Congbo Ma Jodie Avery M. Louise Hull and 1 more

10.48550/arxiv.2307.14126 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Real-time Lightweight Detection Model on Optimized YOLOv7: Applied to the UAVs in Urban Environments

OPENALEX - Publications

Wei Feng Jianping Zhou Xiang Zhi Tan Jing Lin Hu Wang

Considering the conditional constraints in urban environment, traditional method of detecting drones is not applicable, and combining target detection with it an effective solution. We propose a lightweight object model based on YOLOv7. The replaces Backbone network MobileViTv3, which reduces number parameters model; Secondly, SPD-Conv module introduced to increase accuracy small-sized drones; In addition, we integrate SimAM Module (Simple, Parameter-Free Attention Module) find important...

10.1109/icsp58490.2023.10248550 article EN 2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP) 2023-04-21

Learnable Cross-modal Knowledge Distillation for Multi-modal Learning with Missing Modality

OPENALEX - Publications

Hu Wang Yuanhong Chen Congbo Ma Jodie Avery M. Louise Hull and 1 more

The problem of missing modalities is both critical and non-trivial to be handled in multi-modal models. It common for tasks that certain contribute more compared other modalities, if those important are missing, the model performance drops significantly. Such fact remains unexplored by current approaches recover representation from feature reconstruction or blind aggregation instead extracting useful information best performing modalities. In this paper, we propose a Learnable Cross-modal...

10.48550/arxiv.2310.01035 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Compressed-SDR to HDR Video Reconstruction

OPENALEX - Publications

Hu Wang Mao Ye Xiatian Zhu Shuai Li Xue Li and 1 more

The new generation of organic light emitting diode display is designed to enable the high dynamic range (HDR), going beyond standard (SDR) supported by traditional devices. However, a large quantity videos are still SDR format. Further, most pre-existing compressed at varying degrees for minimizing storage and traffic flow demands. To movie-going experience on devices, converting HDR format (i.e., <italic xmlns:mml="http://www.w3.org/1998/Math/MathML"...

10.1109/tpami.2023.3346921 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2023-12-25

Co-seismic Displacement of the 25 April 2015 Nepal Ms8.1 Earthquake Effects on the China's Mount Everest Area Derived from GNSS Data Using the PPP Network Solution by UPD Ambiguity Fixed Technology

OPENALEX - Publications

Hu Wang Jiancheng Li Dang Yamin Yingyan Cheng Jiexian Wang and 3 more

10.11947/j.agcs.2016.f036 article EN DOAJ (DOAJ: Directory of Open Access Journals) 2017-05-01

Risk and return prediction for pricing portfolios of non-performing consumer credit

OPENALEX - Publications

Siyi Wang Yan Xing Bangqi Zheng Hu Wang Wangli Xu and 2 more

We design a system for risk-analyzing and pricing portfolios of non-performing consumer credit loans. The rapid development lending business consumers heightens the need trading formed by overdue loans as manner risk transferring. However, problem is nontrivial technically related research absent. tackle challenge building bottom-up architecture, in which we model distribution every single loan's repayment rate, followed modeling portfolio's overall rate. To address technical issues...

10.48550/arxiv.2110.15102 preprint EN cc-by-nc-nd arXiv (Cornell University) 2021-01-01

Kernel Adversarial Learning for Real-world Image Super-resolution

OPENALEX - Publications

Hu Wang Congbo Ma Jianpeng Zhang Gustavo Carneiro

Current deep image super-resolution (SR) approaches attempt to restore high-resolution images from down-sampled or by assuming degradation simple Gaussian kernels and additive noises. However, such processing techniques represent crude approximations of the real-world procedure lowering resolution. In this paper, we propose a more realistic process lower resolution introducing new Kernel Adversarial Learning Super-resolution (KASR) framework deal with SR problem. proposed framework, noises...

10.48550/arxiv.2104.09008 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Coming Soon ...