Spoken language translation graphs re-decoding using automatic quality assessment

Abstract : This paper investigates how automatic quality assessment of spoken language translation (SLT), also named confidence estimation (CE), can help re-decoding SLT output graphs and improve the overall speech translation performance. Our graph redecoding method can be seen as a second-pass of translation. For this, a robust word confidence estimator for SLT is required. We propose several estimators based on our estimation of transcription (ASR) quality, translation (MT) quality, or both (combined ASR+MT). Using these word confidence measures to re-decode the spoken language translation graph leads to a significant BLEU improvement (more than 2 points) compared to our SLT baseline, for a French-English SLT task. These results could be applied to interactive speech translation or computer-assisted translation of speeches and lectures.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [32 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01289158
Contributor : Benjamin Lecouteux <>
Submitted on : Monday, January 29, 2018 - 11:00:48 AM
Last modification on : Tuesday, February 12, 2019 - 1:31:31 AM
Document(s) archivé(s) le : Friday, May 25, 2018 - 11:10:48 AM

File

ASRU.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Laurent Besacier, Benjamin Lecouteux, Ngoc-Quang Luong, Ngoc-Tien Le. Spoken language translation graphs re-decoding using automatic quality assessment. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Dec 2015, Scotsdale, AZ, United States. ⟨10.1109/ASRU.2015.7404804⟩. ⟨hal-01289158⟩

Share

Metrics

Record views

167

Files downloads

58