Yao Wang

ORCID: 0009-0001-4577-7708
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Speech and Audio Processing
  • Blind Source Separation Techniques
  • Music and Audio Processing
  • Text and Document Classification Technologies
  • Neural Networks and Applications
  • Speech Recognition and Synthesis
  • Neural dynamics and brain function
  • Topic Modeling
  • Neuroscience and Music Perception
  • Natural Language Processing Techniques
  • EEG and Brain-Computer Interfaces

New York University
2018-2024

Japan Advanced Institute of Science and Technology
2022-2024

University of Tsukuba
2024

When we vocalize, our brain distinguishes self-generated sounds from external ones. A corollary discharge signal supports this function in animals; however, humans, its exact origin and temporal dynamics remain unknown. We report electrocorticographic recordings neurosurgical patients a connectivity analysis framework based on Granger causality that reveals major neural communications. find reproducible source for across multiple speech production paradigms localized to the ventral motor...

10.1073/pnas.2404121121 article EN Proceedings of the National Academy of Sciences 2024-12-03

The superior temporal gyrus (STG) region of cortex critically contributes to speech recognition. In this work, we show that a proposed WaveNet, with limited available data, is able reconstruct stimuli from STG intracranial recordings. We further investigate the impulse response fitted model for each recording electrode and observe phoneme level temporospectral tuning properties recorded area cortex. This discovery consistent previous studies implicating posterior (pSTG) in phonetic...

10.1109/spmb.2018.8615605 article EN 2018-12-01

Fake audio detection (FAD) aims to detect fake speech generated by advanced voice conversion and text-to-speech technologies. Recently, the quality of synthesized has significantly improved due remarkable development deep neural networks. However, it is still easy for humans identify perceiving pathological prosody in a voice. Pathological related amplitude frequency perturbation (AFP) provides essential cues speech. This paper proposed analyze AFP differences using jitter shimmer features....

10.23919/apsipaasc55919.2022.9980028 article EN 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2022-11-07

Named Entity Recognition and Relation Extraction are two crucial challenging subtasks in Information Extraction. Despite the successes achieved by traditional approaches, fundamental research questions remain open. First, most recent studies use parameter sharing for a single subtask or shared features both subtasks, ignoring their semantic differences. Second, information interaction mainly focuses on leaving fine-grained among subtask-specific of encoding subjects, relations, objects...

10.1109/access.2024.3420877 article EN cc-by-nc-nd IEEE Access 2024-01-01

The superior temporal gyrus (STG) region of cortex critically contributes to speech recognition. In this work, we show that a proposed WaveNet, with limited available data, is able reconstruct stimuli from STG intracranial recordings. We further investigate the impulse response fitted model for each recording electrode and observe phoneme level temporospectral tuning properties recorded area cortex. This discovery consistent previous studies implicating posterior (pSTG) in phonetic...

10.48550/arxiv.1811.02694 preprint EN other-oa arXiv (Cornell University) 2018-01-01
Coming Soon ...