NFDI4DS | UHH-SEMS - Publication Details

JSUT and JVS: Free Japanese voice corpora for accelerating speech synthesis research

0202 electrical engineering, electronic engineering, information engineering 02 engineering and technology

DOI: 10.1250/ast.41.761 Publication Date: 2020-08-31T22:07:13Z

Abstract Supplemental Material References Cited by

AUTHORS (7)

Shinnosuke Takamichi

Ryosuke Sonobe

Kentaro Mitsui

Yuki Saito

Tomoki Koriyama

Naoko Tanji

Hiroshi Saruwatari

ABSTRACT

In this paper, we develop two corpora for speech synthesis research. Thanks to improvements in machine learning techniques, including deep learning, is becoming a task. To accelerate research, aim at developing Japanese voice reasonably accessible from not only academic institutions but also commercial companies. construct the JSUT and JVS corpora. They are designed mainly text-to-speech conversion, respectively. The corpus contains 10 hours of reading-style uttered by single speaker, 30 containing three styles 100 speakers. This paper describes how summarizes specifications. available our project pages.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES (50)

CITATIONS (20)

EXTERNAL LINKS

CROSSREF - Publications OPENALEX - Publications OPENAIRE - Products

PlumX Metrics

JSUT and JVS: Free Japanese voice corpora for accelerating speech synthesis research

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....