Exploring N-grams Distribution for Sampling-based Alignment
Résumé
We describe an approach to improve the performance of sampling-based multilingual alignment on translation tasks by investigating the distribution of n-grams in the translation tables. This approach consists in enforcing the alignment of n-grams. We compare the quality of phrase translation tables output by this approach and that of MGIZA++ in statistical machine translation tasks. We report significant improvements for this approach and show that merging translation tables outperforms state-of-the-art techniques.
Origine : Fichiers produits par l'(les) auteur(s)
Loading...