Exploring N-grams Distribution for Sampling-based Alignment - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Exploring N-grams Distribution for Sampling-based Alignment

Résumé

We describe an approach to improve the performance of sampling-based multilingual alignment on translation tasks by investigating the distribution of n-grams in the translation tables. This approach consists in enforcing the alignment of n-grams. We compare the quality of phrase translation tables output by this approach and that of MGIZA++ in statistical machine translation tasks. We report significant improvements for this approach and show that merging translation tables outperforms state-of-the-art techniques.
Fichier principal
Vignette du fichier
ltc2011_luo.pdf (69.64 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00650843 , version 1 (12-12-2011)

Identifiants

  • HAL Id : hal-00650843 , version 1

Citer

Juan Luo, Adrien Lardilleux, Yves Lepage. Exploring N-grams Distribution for Sampling-based Alignment. 5th Language & Technology Conference (LTC'11), Nov 2011, Poznań, Poland. pp.5. ⟨hal-00650843⟩
51 Consultations
183 Téléchargements

Partager

Gmail Facebook X LinkedIn More