Spoken language translation graphs re-decoding using automatic quality assessment - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Spoken language translation graphs re-decoding using automatic quality assessment

Résumé

This paper investigates how automatic quality assessment of spoken language translation (SLT), also named confidence estimation (CE), can help re-decoding SLT output graphs and improve the overall speech translation performance. Our graph redecoding method can be seen as a second-pass of translation. For this, a robust word confidence estimator for SLT is required. We propose several estimators based on our estimation of transcription (ASR) quality, translation (MT) quality, or both (combined ASR+MT). Using these word confidence measures to re-decode the spoken language translation graph leads to a significant BLEU improvement (more than 2 points) compared to our SLT baseline, for a French-English SLT task. These results could be applied to interactive speech translation or computer-assisted translation of speeches and lectures.
Fichier principal
Vignette du fichier
ASRU.pdf (188.86 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01289158 , version 1 (29-01-2018)

Identifiants

Citer

Laurent Besacier, Benjamin Lecouteux, Ngoc-Quang Luong, Ngoc-Tien Le. Spoken language translation graphs re-decoding using automatic quality assessment. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Dec 2015, Scotsdale, AZ, United States. ⟨10.1109/ASRU.2015.7404804⟩. ⟨hal-01289158⟩
75 Consultations
204 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More