Wenzhe Liu

ORCID: 0000-0002-0827-6883
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Speech and Audio Processing
  • Advanced Adaptive Filtering Techniques
  • Speech Recognition and Synthesis
  • Music and Audio Processing
  • Hearing Loss and Rehabilitation
  • Advanced Battery Materials and Technologies
  • Advancements in Battery Materials
  • Indoor and Outdoor Localization Technologies
  • Supercapacitor Materials and Fabrication
  • Structural Health Monitoring Techniques
  • Advanced Battery Technologies Research
  • Analytical chemistry methods development
  • Higher Education and Teaching Methods
  • Acoustic Wave Phenomena Research
  • Infant Health and Development
  • Analytical Chemistry and Sensors
  • Electrochemical Analysis and Applications
  • Emotion and Mood Recognition

Chinese Academy of Sciences
2021-2024

University of Chinese Academy of Sciences
2021-2024

Beijing National Laboratory for Molecular Sciences
2023-2024

Institute of Acoustics
2021-2023

Tencent (China)
2023

Harbin Institute of Technology
2023

Dalian University of Technology
2023

Xi'an Aeronautical University
2006

For challenging acoustic scenarios as low signal-to-noise ratios, current speech enhancement systems usually suffer from performance bottleneck in extracting the target mixtures within one step. To address this issue, we propose a novel complex spectral mapping approach with two-stage pipeline for monaural time-frequency domain. The proposed algorithm aims to decouple primal problem into multiple sub-problems, which follows classic proverb, "two heads are better than one". More specifically,...

10.1109/taslp.2021.3079813 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2021-01-01

Layered transition metal oxide cathodes have been one of the dominant for lithium-ion batteries with efficient Li+ intercalation chemistry. However, limited by weak layered interaction and unstable surface, mechanical chemical failure plagues their electrochemical performance, especially Ni-rich cathodes. Here, adopting a simultaneous elemental-structural atomic arrangement control based on intrinsic Ni-Co-Mn system, surface role is intensively investigated. Within invariant oxygen...

10.1002/anie.202302170 article EN Angewandte Chemie International Edition 2023-04-01

Background noise and room reverberation are regarded as two major factors to degrade the subjective speech quality.In this paper, we propose an integrated framework address simultaneous denoising dereverberation under complicated scenario environments.It adopts a chain optimization strategy designs four sub-stages accordingly.In first stages, decouple multi-task learning w.r.t.complex spectrum into magnitude phase, only implement removal in domain.Based on estimated priors above, further...

10.21437/interspeech.2021-1137 article EN Interspeech 2022 2021-08-27

Ni-rich cathodes are some of the most promising candidates for advanced lithium-ion batteries, but their available capacities have been stagnant due to intrinsic Li

10.1021/jacs.4c04756 article EN Journal of the American Chemical Society 2024-05-15

It remains a tough challenge to recover the speech signals contaminated by various noises under real acoustic environments. To this end, we propose novel system for denoising in complicated applications, which is mainly comprised of two pipelines, namely two-stage network and post-processing module. The first pipeline proposed decouple optimization problem w.r.t. magnitude phase, i.e., only estimated stage both them are further refined second stage. aims suppress remaining unnatural...

10.1109/icassp39728.2021.9414062 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021-05-13

Standing upon the intersection of traditional beamformers and deep neural networks, we propose a causal beamformer paradigm called Embedding Beamforming, two core modules are devised accordingly, namely EM BM. For EM, instead estimating spatial covariance matrix explicitly, 3-D embedding tensor is learned with network, where spatial-spectral discriminative information can be implicitly represented. BM, network directly leveraged to derive beamforming weights so as implement filter-and-sum...

10.1109/icassp43922.2022.9746432 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

While deep neural networks have facilitated significant advancements in the field of speech enhancement, most existing methods are developed following either empirical or relatively blind criteria, lacking adequate guidelines pipeline design. Inspired by Taylor's theorem, we propose a general unfolding framework for both single- and multi-channel enhancement tasks. Concretely, formulate complex spectrum recovery into spectral magnitude mapping neighborhood space noisy mixture, which an...

10.1109/taslp.2023.3313442 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2023-01-01

This paper describes the legends-tencent team's real-time General Speech Restoration (Gesper) system submitted to ICASSP 2023 Signal Improvement (SSI) Challenge. newly proposed is a two-stage architecture, in which speech restoration performed, and then followed by enhancement. We propose complex spectral mapping-based generative adversarial network (CSM-GAN) as module for first time. For noise suppression dereverberation, enhancement presented with fullband-wideband parallel processing. On...

10.1109/icassp49357.2023.10095557 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

It is highly desirable that speech enhancement algorithms can achieve good performance while keeping low latency for many applications, such as digital hearing aids, mobile phones, acoustically transparent devices, and public address systems. To improve the of traditional low-latency algorithms, a deep filter-bank equalizer (FBE) framework was proposed integrated learning-based subband noise reduction network with shortened filter mapping network. In first network, learning model trained...

10.1121/10.0011396 article EN The Journal of the Acoustical Society of America 2022-05-01

Most deep-learning-based multi-channel speech enhancement methods focus on designing a set of beamforming coefficients, to directly filter the low signal-to-noise ratio signals received by microphones, which hinders performance these approaches. To handle problems, this paper designs causal neural that fully exploits spectro-temporal-spatial information in beamspace domain. Specifically, multiple beams are designed steer towards all directions, using parameterized super-directive beamformer...

10.3390/sym14061081 article EN Symmetry 2022-05-24

Abstract Layered transition metal oxide cathodes have been one of the dominant for lithium‐ion batteries with efficient Li + intercalation chemistry. However, limited by weak layered interaction and unstable surface, mechanical chemical failure plagues their electrochemical performance, especially Ni‐rich cathodes. Here, adopting a simultaneous elemental‐structural atomic arrangement control based on intrinsic Ni−Co−Mn system, surface role is intensively investigated. Within invariant oxygen...

10.1002/ange.202302170 article EN Angewandte Chemie 2023-04-01

10.1016/j.engappai.2023.107286 article EN Engineering Applications of Artificial Intelligence 2023-10-21

Due to the high computational complexity model more frequency bands, it is still intractable conduct full-band speech enhancement based on deep neural networks. Recent studies typically utilize compressed perceptually motivated features with relatively low resolution filter spectrum by one-stage networks, leading limited quality improvements. In this paper, we propose a coordinated sub-band fusion network for enhancement, which aims recover low- (0-8kHz), middle- (8-16kHz), and high-band...

10.1109/iscslp57327.2022.10037937 article EN 2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP) 2022-12-11

It remains a tough challenge to recover the speech signals contaminated by various noises under real acoustic environments. To this end, we propose novel system for denoising in complicated applications, which is mainly comprised of two pipelines, namely two-stage network and post-processing module. The first pipeline proposed decouple optimization problem w:r:t: magnitude phase, i.e., only estimated stage both them are further refined second stage. aims suppress remaining unnatural...

10.48550/arxiv.2102.04198 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Most deep learning-based multi-channel speech enhancement methods focus on designing a set of beamforming coefficients to directly filter the low signal-to-noise ratio signals received by microphones, which hinders performance these approaches. To handle problems, this paper designs causal neural beam that fully exploits spatial-spectral information in domain. Specifically, multiple beams are designed steer towards all directions using parameterized super-directive beamformer first stage....

10.48550/arxiv.2202.02500 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Due to the high computational complexity model more frequency bands, it is still intractable conduct real-time full-band speech enhancement based on deep neural networks. Recent studies typically utilize compressed perceptually motivated features with relatively low resolution filter spectrum by one-stage networks, leading limited quality improvements. In this paper, we propose a coordinated sub-band fusion network for enhancement, which aims recover low- (0-8 kHz), middle- (8-16 and...

10.48550/arxiv.2203.16033 preprint EN cc-by arXiv (Cornell University) 2022-01-01

The spatial covariance matrix has been considered to be significant for beamformers. Standing upon the intersection of traditional beamformers and deep neural networks, we propose a causal beamformer paradigm called Embedding Beamforming, two core modules are designed accordingly, namely EM BM. For EM, instead estimating explicitly, 3-D embedding tensor is learned with network, where both spectral discriminative information can represented. BM, network directly leveraged derive beamforming...

10.48550/arxiv.2109.00265 preprint EN other-oa arXiv (Cornell University) 2021-01-01
Coming Soon ...