Improving sampling-based alignment by investigating the distribution of n-grams in phrase translation tables
Résumé
This paper describes an approach to improve the performance of sampling-based multilingual alignment on translation tasks by investigating the distribution of n-grams in the translation tables. This approach consists in enforcing the alignment of n-grams. The quality of phrase translation tables output by this approach and that of MGIZA++ is compared in statistical machine translation tasks. Significant improvements for this approach are reported. In addition, merging translation tables is shown to outperform state-of-the-art techniques.
Origine : Fichiers produits par l'(les) auteur(s)
Loading...