NFDI4DS | UHH-SEMS - Publication Details

End-to-End Speech Translation with the Transformer

Speech translation End-to-end principle

DOI: 10.21437/iberspeech.2018-13 Publication Date: 2018-11-19T13:02:30Z

Abstract Supplemental Material References Cited by

AUTHORS (4)

Laura Cross Vila

Carlos Escolano

José A. R. Fonollosa

Marta R. Costa-Jussà

ABSTRACT

Speech Translation has been traditionally addressed with the concatenation of two tasks: Speech Recognition and Machine Translation. This approach has the main drawback that errors are concatenated. Recently, neural approaches to Speech Recognition and Machine Translation have made possible facing the task by means of an End-to-End Speech Translation architecture. In this paper, we propose to use the architecture of the Transformer which is based solely on attention-based mechanisms to address the End-to-End Speech Translation system. As a contrastive architecture, we use the same Transformer to built the Speech Recognition and Machine Translation systems to perform Speech Translation through concatenation of systems. Results on a Spanish-to-English standard task show that the end-to-end architecture is able to outperform the concatenated systems by half point BLEU. Peer Reviewed

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES (0)

CITATIONS (13)

EXTERNAL LINKS

OPENALEX - Publications CROSSREF - Publications OPENAIRE - Products

PlumX Metrics

End-to-End Speech Translation with the Transformer

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....