Voice conversion based on static speaker characteristics
Speaker identity
Statistics
Speech recognition
Dynamic programming
01 natural sciences
410
Speech synthesis
400
Pitch extraction
Speech processing
0103 physical sciences
Speech analysis
Algorithms
Voice conversion
DOI:
10.1109/comsig.1998.736922
Publication Date:
2002-11-27T18:21:56Z
AUTHORS (2)
ABSTRACT
Voice conversion has recently emerged as an interesting branch of speech processing that deals with the modification of a speaker's perceived identity. This technology has applications in speech recognition, the entertainment and security industries. This paper provides a brief introduction to current voice conversion approaches, and discusses the development of the PASS system, a parametric voice conversion algorithm based on static speaker characteristics. The system is easy to implement, requires no phonetic transcription of the speech data, and is shown to be valuable in the case where very little training data is available. Particular mention is made of the pitch extraction subsystem, which uses a novel pitch determination algorithm to ensure the robust estimation of pitch statistics.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (31)
CITATIONS (2)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....