Skip to Main content Skip to Navigation
Conference papers

End-to-End Automatic Speech Translation of Audiobooks

Abstract : We investigate end-to-end speech-to-text translation on a corpus of audiobooks specifically augmented for this task. Previous works investigated the extreme case where source language transcription is not available during learning nor decoding , but we also study a midway case where source language transcription is available at training time only. In this case, a single model is trained to decode source speech into target text in a single pass. Experimental results show that it is possible to train compact and efficient end-to-end speech translation models in this setup. We also distribute the corpus and hope that our speech translation baseline on this corpus will be challenged in the future.
Document type :
Conference papers
Complete list of metadata

Cited literature [16 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01709586
Contributor : Laurent Besacier Connect in order to contact the contributor
Submitted on : Thursday, February 15, 2018 - 10:20:33 AM
Last modification on : Tuesday, May 11, 2021 - 11:37:20 AM
Long-term archiving on: : Sunday, May 6, 2018 - 4:30:10 AM

File

main.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01709586, version 1

Citation

Alexandre Bérard, Laurent Besacier, Ali Can Kocabiyikoglu, Olivier Pietquin. End-to-End Automatic Speech Translation of Audiobooks. ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada. ⟨hal-01709586⟩

Share

Metrics

Record views

601

Files downloads

1356