NFDI4DS | UHH-SEMS - Publication Details

A corollary discharge circuit in human speech

OPENALEX - Publications

Amirhossein Khalilian-Gourtani Ran Wang Xupeng Chen Leyao Yu Patricia Dugan and 5 more

When we vocalize, our brain distinguishes self-generated sounds from external ones. A corollary discharge signal supports this function in animals; however, humans, its exact origin and temporal dynamics remain unknown. We report electrocorticographic recordings neurosurgical patients a connectivity analysis framework based on Granger causality that reveals major neural communications. find reproducible source for across multiple speech production paradigms localized to the ventral motor...

10.1073/pnas.2404121121 article EN Proceedings of the National Academy of Sciences 2024-12-03

Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach

OPENALEX - Publications

Ran Wang Yao Wang Adeen Flinker

The superior temporal gyrus (STG) region of cortex critically contributes to speech recognition. In this work, we show that a proposed WaveNet, with limited available data, is able reconstruct stimuli from STG intracranial recordings. We further investigate the impulse response fitted model for each recording electrode and observe phoneme level temporospectral tuning properties recorded area cortex. This discovery consistent previous studies implicating posterior (pSTG) in phonetic...

10.1109/spmb.2018.8615605 article EN 2018-12-01

Analysis of Amplitude and Frequency Perturbation in the Voice for Fake Audio Detection

OPENALEX - Publications

Kai Li Yao Wang Le-Minh Nguyen Masato Akagi Masashi Unoki

Fake audio detection (FAD) aims to detect fake speech generated by advanced voice conversion and text-to-speech technologies. Recently, the quality of synthesized has significantly improved due remarkable development deep neural networks. However, it is still easy for humans identify perceiving pathological prosody in a voice. Pathological related amplitude frequency perturbation (AFP) provides essential cues speech. This paper proposed analyze AFP differences using jitter shimmer features....

10.23919/apsipaasc55919.2022.9980028 article EN 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2022-11-07

A Decoupling and Aggregating Framework for Joint Extraction of Entities and Relations

OPENALEX - Publications

Yao Wang Xin Liu Wei Kun Kong Hai-Tao Yu Teeradaj Racharak and 2 more

Named Entity Recognition and Relation Extraction are two crucial challenging subtasks in Information Extraction. Despite the successes achieved by traditional approaches, fundamental research questions remain open. First, most recent studies use parameter sharing for a single subtask or shared features both subtasks, ignoring their semantic differences. Second, information interaction mainly focuses on leaving fine-grained among subtask-specific of encoding subjects, relations, objects...

10.1109/access.2024.3420877 article EN cc-by-nc-nd IEEE Access 2024-01-01

Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach

OPENALEX - Publications

Ran Wang Yao Wang Adeen Flinker

The superior temporal gyrus (STG) region of cortex critically contributes to speech recognition. In this work, we show that a proposed WaveNet, with limited available data, is able reconstruct stimuli from STG intracranial recordings. We further investigate the impulse response fitted model for each recording electrode and observe phoneme level temporospectral tuning properties recorded area cortex. This discovery consistent previous studies implicating posterior (pSTG) in phonetic...

10.48550/arxiv.1811.02694 preprint EN other-oa arXiv (Cornell University) 2018-01-01