NFDI4DS | UHH-SEMS - Publication Details

Hang Chen

ORCID: 0000-0002-0904-8946

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5029478820

Research Areas

Speech and Audio Processing
Speech Recognition and Synthesis
Advanced Adaptive Filtering Techniques
Music and Audio Processing
Image and Signal Denoising Methods
Infrastructure Maintenance and Monitoring
Multilingual Education and Policy
Photovoltaic System Optimization Techniques
3D Surveying and Cultural Heritage
Hearing Loss and Rehabilitation
Emotion and Mood Recognition
Solar Thermal and Photovoltaic Systems
Speech and dialogue systems
Solar Radiation and Photovoltaics
Geotechnical Engineering and Analysis
Second Language Learning and Teaching
EFL/ESL Teaching and Learning

University of Science and Technology of China
2021-2025

Assessment of Power Loss Caused by Soiling PV Modules Using a Dual Branch Multi-Modality Deep Learning Network Framework

OPENALEX - Publications

Peijie Lin Hang Chen Shuying Cheng Xiaoyang Lu Yaohai Lin and 1 more

10.1016/j.renene.2025.122926 article EN Renewable Energy 2025-04-01

HPCNet: Hybrid Pixel and Contour Network for Audio-Visual Speech Enhancement with Low-Quality Video

OPENALEX - Publications

Hang Chen Chen-Yue Zhang Qing Wang Jun Du Sabato Marco Siniscalchi and 2 more

10.1109/jstsp.2025.3559763 article EN IEEE Journal of Selected Topics in Signal Processing 2025-01-01

Cross-attention among spectrum, waveform and SSL representations with bidirectional knowledge distillation for speech enhancement

OPENALEX - Publications

Hang Chen Chenxi Wang Qing Wang Jun Du Sabato Marco Siniscalchi and 3 more

10.1016/j.inffus.2025.103218 article EN Information Fusion 2025-04-01

Evaluating feature contribution to wall deflection of braced excavation using interpretable XGBoost-SHAP model

OPENALEX - Publications

Yadong Liu Xian Liu Hesong Hu Hang Chen Shengfang Qiao

10.1117/12.3062704 article EN 2025-04-25

Correlating subword articulation with lip shapes for embedding aware audio-visual speech enhancement

OPENALEX - Publications

Hang Chen Jun Du Yu Hu Li-Rong Dai Baocai Yin and 1 more

10.1016/j.neunet.2021.06.003 article EN Neural Networks 2021-06-07

Collaborative Viseme Subword and End-to-End Modeling for Word-Level Lip Reading

OPENALEX - Publications

Hang Chen Qing Wang Jun Du Genshun Wan Xiong Shifu and 3 more

We propose a viseme subword modeling (VSM) approach to improve the generalizability and interpretability capabilities of deep neural network based lip reading. A comprehensive analysis preliminary experimental results reveals complementary nature conventional end-to-end (E2E) proposed VSM frameworks, especially concerning speaker head movements. To increase reading accuracy, we hybrid subwords (HVSEM), which exploits strengths both approaches through multitask learning. As an extension...

10.1109/tmm.2024.3390148 article EN IEEE Transactions on Multimedia 2024-01-01

Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition

OPENALEX - Publications

Hang Chen Qing Wang Jun Du Baocai Yin Jia Pan and 1 more

A multi-level distortion measure (MLDM) is proposed as an objective to optimize deep neural network-based speech enhancement (SE) in both audio-only and audio-visual scenarios. The aim achieve simultaneous performance improvements quality, intelligibility, recognition error reductions. Moreover, a comprehensive correlation analysis shows that these three evaluation metrics exhibit high Pearson coefficient (PCC) values with commonly used optimization objectives: the mean squared between ideal...

10.1109/taslp.2024.3393732 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2024-01-01

Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023

OPENALEX - Publications

Haotian Wang Yuxuan Xi Hang Chen Jun Du Yan Song and 9 more

In this paper, we propose a novel framework for recognizing both discrete and dimensional emotions. our framework, deep features extracted from foundation models are used as robust acoustic visual representations of raw video. Three different structures based on attention-guided feature gathering (AFG) designed fusion. Then, introduce joint decoding structure emotion classification valence regression in the stage. A multi-task loss uncertainty is also to optimize whole process. Finally, by...

10.1145/3581783.3612859 preprint EN 2023-10-26

Unveiling Language Skills under Circuits

OPENALEX - Publications

Hang Chen Jiaying Zhu Xinyu Yang Wenya Wang

The exploration of language skills in models (LMs) has always been one the central goals mechanistic interpretability. However, existing circuit analyses often fall short representing full functional scope these models, primarily due to exclusion Feed-Forward layers. Additionally, isolating effect a single skill from text, which inherently involves multiple entangled skills, poses significant challenge. To address gaps, we introduce novel concept, Memory Circuit, minimum unit that fully and...

10.48550/arxiv.2410.01334 preprint EN arXiv (Cornell University) 2024-10-02

Layer-Adaptive Low-Rank Adaptation of Large ASR Model for Low-Resource Multilingual Scenarios

OPENALEX - Publications

Yi Han Hang Chen Jun Du Changqing Kong Shifu Xiong and 1 more

10.1109/iscslp63861.2024.10800407 article EN 2024-11-07

Space-and-speaker-aware acoustic modeling with effective data augmentation for recognition of multi-array conversational speech

OPENALEX - Publications

Li Chai Hang Chen Jun Du Qingfeng Liu Chin‐Hui Lee

10.1016/j.specom.2023.102958 article EN Speech Communication 2023-07-14

Coming Soon ...