Limitations of MT Quality Estimation Supervised Systems: The Tails Prediction Problem - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Limitations of MT Quality Estimation Supervised Systems: The Tails Prediction Problem

Résumé

In this paper we address the question of the reliability of the predictions made by MT Quality Estimation (QE) systems. In particular, we show that standard supervised QE systems, usually trained to minimize MAE, make serious mistakes at predicting the quality of the sentences in the tails of the quality range. We describe the problem and propose several experiments to clarify their causes and effects. We use the WMT12 and WMT13 QE Shared Task datasets to prove that our claims hold in general and are not specific to a dataset or a system.
Fichier principal
Vignette du fichier
coling2014.pdf (1.3 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-01066394 , version 1 (19-09-2014)

Identifiants

  • HAL Id : hal-01066394 , version 1

Citer

Erwan Moreau, Carl Vogel. Limitations of MT Quality Estimation Supervised Systems: The Tails Prediction Problem. Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics, Aug 2014, Dublin, Ireland. pp.2205--2216. ⟨hal-01066394⟩
63 Consultations
96 Téléchargements

Partager

Gmail Facebook X LinkedIn More