NFDI4DS | UHH-SEMS - Publication Details

Wenzhe Liu

ORCID: 0000-0002-0827-6883

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100679522

Research Areas

Speech and Audio Processing
Advanced Adaptive Filtering Techniques
Speech Recognition and Synthesis
Music and Audio Processing
Hearing Loss and Rehabilitation
Advanced Battery Materials and Technologies
Advancements in Battery Materials
Indoor and Outdoor Localization Technologies
Supercapacitor Materials and Fabrication
Structural Health Monitoring Techniques
Advanced Battery Technologies Research
Analytical chemistry methods development
Higher Education and Teaching Methods
Acoustic Wave Phenomena Research
Infant Health and Development
Analytical Chemistry and Sensors
Electrochemical Analysis and Applications
Emotion and Mood Recognition

Chinese Academy of Sciences
2021-2024

University of Chinese Academy of Sciences
2021-2024

Beijing National Laboratory for Molecular Sciences
2023-2024

Institute of Acoustics
2021-2023

Tencent (China)
2023

Harbin Institute of Technology
2023

Dalian University of Technology
2023

Xi'an Aeronautical University
2006

Two Heads are Better Than One: A Two-Stage Complex Spectral Mapping Approach for Monaural Speech Enhancement

OPENALEX - Publications

Andong Li Wenzhe Liu Chengshi Zheng Cunhang Fan Xiaodong Li

For challenging acoustic scenarios as low signal-to-noise ratios, current speech enhancement systems usually suffer from performance bottleneck in extracting the target mixtures within one step. To address this issue, we propose a novel complex spectral mapping approach with two-stage pipeline for monaural time-frequency domain. The proposed algorithm aims to decouple primal problem into multiple sub-problems, which follows classic proverb, "two heads are better than one". More specifically,...

10.1109/taslp.2021.3079813 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2021-01-01

Chemical‐Mechanical Robustness of Single‐Crystalline Ni‐Rich Cathode Enabled by Surface Atomic Arrangement Control

OPENALEX - Publications

Xin‐Hai Meng Xu‐Dong Zhang Hang Sheng Min Fan Ting Lin and 7 more

Layered transition metal oxide cathodes have been one of the dominant for lithium-ion batteries with efficient Li+ intercalation chemistry. However, limited by weak layered interaction and unstable surface, mechanical chemical failure plagues their electrochemical performance, especially Ni-rich cathodes. Here, adopting a simultaneous elemental-structural atomic arrangement control based on intrinsic Ni-Co-Mn system, surface role is intensively investigated. Within invariant oxygen...

10.1002/anie.202302170 article EN Angewandte Chemie International Edition 2023-04-01

A Simultaneous Denoising and Dereverberation Framework with Target Decoupling

OPENALEX - Publications

Andong Li Wenzhe Liu Xiaoxue Luo Guochen Yu Chengshi Zheng and 1 more

Background noise and room reverberation are regarded as two major factors to degrade the subjective speech quality.In this paper, we propose an integrated framework address simultaneous denoising dereverberation under complicated scenario environments.It adopts a chain optimization strategy designs four sub-stages accordingly.In first stages, decouple multi-task learning w.r.t.complex spectrum into magnitude phase, only implement removal in domain.Based on estimated priors above, further...

10.21437/interspeech.2021-1137 article EN Interspeech 2022 2021-08-27

Self-Limiting Phase Transition Enabling Reversible Overstoichiometric Li Storage in Ni-Rich Cathodes

OPENALEX - Publications

Xin‐Hai Meng Dongdong Xiao Ziyi Zhou Wenzhe Liu Ji‐Lei Shi and 2 more

Ni-rich cathodes are some of the most promising candidates for advanced lithium-ion batteries, but their available capacities have been stagnant due to intrinsic Li

10.1021/jacs.4c04756 article EN Journal of the American Chemical Society 2024-05-15

ICASSP 2021 Deep Noise Suppression Challenge: Decoupling Magnitude and Phase Optimization with a Two-Stage Deep Network

OPENALEX - Publications

Andong Li Wenzhe Liu Xiaoxue Luo Chengshi Zheng Xiaodong Li

It remains a tough challenge to recover the speech signals contaminated by various noises under real acoustic environments. To this end, we propose novel system for denoising in complicated applications, which is mainly comprised of two pipelines, namely two-stage network and post-processing module. The first pipeline proposed decouple optimization problem w.r.t. magnitude phase, i.e., only estimated stage both them are further refined second stage. aims suppress remaining unnatural...

10.1109/icassp39728.2021.9414062 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021-05-13

Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement

OPENALEX - Publications

Andong Li Wenzhe Liu Chengshi Zheng Xiaodong Li

Standing upon the intersection of traditional beamformers and deep neural networks, we propose a causal beamformer paradigm called Embedding Beamforming, two core modules are devised accordingly, namely EM BM. For EM, instead estimating spatial covariance matrix explicitly, 3-D embedding tensor is learned with network, where spatial-spectral discriminative information can be implicitly represented. BM, network directly leveraged to derive beamforming weights so as implement filter-and-sum...

10.1109/icassp43922.2022.9746432 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

A General Unfolding Speech Enhancement Method Motivated by Taylor's Theorem

OPENALEX - Publications

Andong Li Guochen Yu Chengshi Zheng Wenzhe Liu Xiaodong Li

While deep neural networks have facilitated significant advancements in the field of speech enhancement, most existing methods are developed following either empirical or relatively blind criteria, lacking adequate guidelines pipeline design. Inspired by Taylor's theorem, we propose a general unfolding framework for both single- and multi-channel enhancement tasks. Concretely, formulate complex spectrum recovery into spectral magnitude mapping neighborhood space noisy mixture, which an...

10.1109/taslp.2023.3313442 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2023-01-01

Iodine revisited: If and how inorganic iodine species can be measured reliably and what cause their conversions in water?

OPENALEX - Publications

Huimei Pan Boqiang Li Jie Yang Wenzhe Liu Wang Luo and 1 more

10.1016/j.jhazmat.2023.132423 article EN Journal of Hazardous Materials 2023-08-28

Gesper: A Unified Framework for General Speech Restoration

OPENALEX - Publications

Jun Chen Yupeng Shi Wenzhe Liu Wei Rao Shulin He and 5 more

This paper describes the legends-tencent team's real-time General Speech Restoration (Gesper) system submitted to ICASSP 2023 Signal Improvement (SSI) Challenge. newly proposed is a two-stage architecture, in which speech restoration performed, and then followed by enhancement. We propose complex spectral mapping-based generative adversarial network (CSM-GAN) as module for first time. For noise suppression dereverberation, enhancement presented with fullband-wideband parallel processing. On...

10.1109/icassp49357.2023.10095557 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Low-latency monaural speech enhancement with deep filter-bank equalizer

OPENALEX - Publications

Chengshi Zheng Wenzhe Liu Andong Li Yuxuan Ke Xiaodong Li

It is highly desirable that speech enhancement algorithms can achieve good performance while keeping low latency for many applications, such as digital hearing aids, mobile phones, acoustically transparent devices, and public address systems. To improve the of traditional low-latency algorithms, a deep filter-bank equalizer (FBE) framework was proposed integrated learning-based subband noise reduction network with shortened filter mapping network. In first network, learning model trained...

10.1121/10.0011396 article EN The Journal of the Acoustical Society of America 2022-05-01

A Neural Beamspace-Domain Filter for Real-Time Multi-Channel Speech Enhancement

OPENALEX - Publications

Wenzhe Liu Andong Li Xiao Wang Minmin Yuan Yi Chen and 2 more

Most deep-learning-based multi-channel speech enhancement methods focus on designing a set of beamforming coefficients, to directly filter the low signal-to-noise ratio signals received by microphones, which hinders performance these approaches. To handle problems, this paper designs causal neural that fully exploits spectro-temporal-spatial information in beamspace domain. Specifically, multiple beams are designed steer towards all directions, using parameterized super-directive beamformer...

10.3390/sym14061081 article EN Symmetry 2022-05-24

FSI-Net: A dual-stage full- and sub-band integration network for full-band speech enhancement

OPENALEX - Publications

Guochen Yu Hui Wang Andong Li Wenzhe Liu Yuan Zhang and 2 more

10.1016/j.apacoust.2023.109539 article EN Applied Acoustics 2023-07-26

A separation and interaction framework for causal multi-channel speech enhancement

OPENALEX - Publications

Wenzhe Liu Andong Li Chengshi Zheng Xiaodong Li

10.1016/j.dsp.2022.103519 article EN Digital Signal Processing 2022-03-10

Chemical‐Mechanical Robustness of Single‐Crystalline Ni‐Rich Cathode Enabled by Surface Atomic Arrangement Control

OPENALEX - Publications

Xin‐Hai Meng Xu‐Dong Zhang Hang Sheng Min Fan Ting Lin and 7 more

Abstract Layered transition metal oxide cathodes have been one of the dominant for lithium‐ion batteries with efficient Li + intercalation chemistry. However, limited by weak layered interaction and unstable surface, mechanical chemical failure plagues their electrochemical performance, especially Ni‐rich cathodes. Here, adopting a simultaneous elemental‐structural atomic arrangement control based on intrinsic Ni−Co−Mn system, surface role is intensively investigated. Within invariant oxygen...

10.1002/ange.202302170 article EN Angewandte Chemie 2023-04-01

A Primary task driven adaptive loss function for multi-task speech emotion recognition

OPENALEX - Publications

Luyao Liu Wenzhe Liu Lin Feng

10.1016/j.engappai.2023.107286 article EN Engineering Applications of Artificial Intelligence 2023-10-21

Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Full-Band Speech Enhancement

OPENALEX - Publications

Guochen Yu Andong Li Wenzhe Liu Chengshi Zheng Yutian Wang and 1 more

Due to the high computational complexity model more frequency bands, it is still intractable conduct full-band speech enhancement based on deep neural networks. Recent studies typically utilize compressed perceptually motivated features with relatively low resolution filter spectrum by one-stage networks, leading limited quality improvements. In this paper, we propose a coordinated sub-band fusion network for enhancement, which aims recover low- (0-8kHz), middle- (8-16kHz), and high-band...

10.1109/iscslp57327.2022.10037937 article EN 2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) 2022-12-11

Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction

OPENALEX - Publications

Wenzhe Liu Yupeng Shi Jun Chen Wei Rao Shulin He and 3 more

10.21437/interspeech.2023-1511 article EN Interspeech 2022 2023-08-14

ICASSP 2021 Deep Noise Suppression Challenge: Decoupling Magnitude and Phase Optimization with a Two-Stage Deep Network

OPENALEX - Publications

Andong Li Wenzhe Liu Xiaoxue Luo Chengshi Zheng Xiaodong Li

It remains a tough challenge to recover the speech signals contaminated by various noises under real acoustic environments. To this end, we propose novel system for denoising in complicated applications, which is mainly comprised of two pipelines, namely two-stage network and post-processing module. The first pipeline proposed decouple optimization problem w:r:t: magnitude phase, i.e., only estimated stage both them are further refined second stage. aims suppress remaining unnatural...

10.48550/arxiv.2102.04198 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Know Your Enemy, Know Yourself: A Unified Two-Stage Framework for Speech Enhancement

OPENALEX - Publications

Wenzhe Liu Andong Li Yuxuan Ke Chengshi Zheng Xiaodong Li

10.21437/interspeech.2021-238 article EN Interspeech 2022 2021-08-27

TaylorBeamixer: Learning Taylor-Inspired All-Neural Multi-Channel Speech Enhancement from Beam-Space Dictionary Perspective

OPENALEX - Publications

Andong Li Weixin Meng Guochen Yu Wenzhe Liu Xiaodong Li and 1 more

10.21437/interspeech.2023-514 article EN Interspeech 2022 2023-08-14

Multi-mode Neural Speech Coding Based on Deep Generative Networks

OPENALEX - Publications

Wei Xiao Wenzhe Liu Meng Wang Shan Yang Yupeng Shi and 4 more

10.21437/interspeech.2023-1490 article EN Interspeech 2022 2023-08-14

A Neural Beam Filter for Real-time Multi-channel Speech Enhancement

OPENALEX - Publications

Wenzhe Liu Andong Li Chengshi Zheng Xiaodong Li

Most deep learning-based multi-channel speech enhancement methods focus on designing a set of beamforming coefficients to directly filter the low signal-to-noise ratio signals received by microphones, which hinders performance these approaches. To handle problems, this paper designs causal neural beam that fully exploits spatial-spectral information in domain. Specifically, multiple beams are designed steer towards all directions using parameterized super-directive beamformer first stage....

10.48550/arxiv.2202.02500 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement

OPENALEX - Publications

Guochen Yu Andong Li Wenzhe Liu Chengshi Zheng Yutian Wang and 1 more

Due to the high computational complexity model more frequency bands, it is still intractable conduct real-time full-band speech enhancement based on deep neural networks. Recent studies typically utilize compressed perceptually motivated features with relatively low resolution filter spectrum by one-stage networks, leading limited quality improvements. In this paper, we propose a coordinated sub-band fusion network for enhancement, which aims recover low- (0-8 kHz), middle- (8-16 and...

10.48550/arxiv.2203.16033 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Embedding and Beamforming: All-neural Causal Beamformer for Multichannel Speech Enhancement

OPENALEX - Publications

Andong Li Wenzhe Liu Chengshi Zheng Xiaodong Li

The spatial covariance matrix has been considered to be significant for beamformers. Standing upon the intersection of traditional beamformers and deep neural networks, we propose a causal beamformer paradigm called Embedding Beamforming, two core modules are designed accordingly, namely EM BM. For EM, instead estimating explicitly, 3-D embedding tensor is learned with network, where both spectral discriminative information can represented. BM, network directly leveraged derive beamforming...

10.48550/arxiv.2109.00265 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Coming Soon ...