JSUT and JVS: Free Japanese voice corpora for accelerating speech synthesis research
0202 electrical engineering, electronic engineering, information engineering
02 engineering and technology
DOI:
10.1250/ast.41.761
Publication Date:
2020-08-31T22:07:13Z
AUTHORS (7)
ABSTRACT
In this paper, we develop two corpora for speech synthesis research. Thanks to improvements in machine learning techniques, including deep learning, is becoming a task. To accelerate research, aim at developing Japanese voice reasonably accessible from not only academic institutions but also commercial companies. construct the JSUT and JVS corpora. They are designed mainly text-to-speech conversion, respectively. The corpus contains 10 hours of reading-style uttered by single speaker, 30 containing three styles 100 speakers. This paper describes how summarizes specifications. available our project pages.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (50)
CITATIONS (20)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....