Skip to Main content Skip to Navigation
Journal articles

Find The Errors, Get The Better: Enhancing Machine Translation via Word Confidence Estimation

Abstract : This article presents two novel ideas of improving the Machine Translation (MT) quality by applying the word-level quality prediction for the second pass of decoding. In this manner, the word scores estimated by Word Con dence Estimation (WCE) systems help to reconsider the MT hypotheses for selecting a better candidate rather than accepting the current sub-optimal one. In the rst attempt, the selection scope is limited to the MT N-best list, in which our proposed re-ranking features are combined with those of the decoder for re-scoring. Then, the search space is enlarged over the entire search graph, storing many more hypotheses generated during the rst pass of decoding. Over all paths containing words of the N-best list, we propose an algorithm to strengthen or weaken them depending on the estimated word quality. In both methods, the highest-score candidate after the search becomes the ocial translation. The results obtained show that both approaches advance the MT quality over the one-pass baseline, and the Search Graph Re-decoding achieves more gains (in BLEU score) than N-best List Re-ranking method.
Document type :
Journal articles
Complete list of metadata

Cited literature [48 references]  Display  Hide  Download
Contributor : Laurent Besacier Connect in order to contact the contributor
Submitted on : Thursday, November 9, 2017 - 3:04:32 PM
Last modification on : Wednesday, November 3, 2021 - 6:46:37 AM
Long-term archiving on: : Saturday, February 10, 2018 - 1:31:58 PM


Files produced by the author(s)


  • HAL Id : hal-01436779, version 1


Ngoc-Quang Luong, Laurent Besacier, Benjamin Lecouteux. Find The Errors, Get The Better: Enhancing Machine Translation via Word Confidence Estimation. Natural Language Engineering, Cambridge University Press (CUP), 2017, 1, pp.1 - 24. ⟨hal-01436779⟩



Record views


Files downloads