Translation of sublanguages by subgrammars - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2009

Translation of sublanguages by subgrammars

Résumé

This paper discusses the performance of two data-driven translation methods for the translation of a very constrained sublanguage: dates. As a first result, we show that an example-based method is outperformed by a statistical method for the translation of dates from Chinese into English when small random training corpora are used: 750 random examples suffice to translate almost perfectly a corpus of 4,018 dates for both methods. As a second result, we prove that 58 dates theoretically suffice to translate the same corpus of 4,018 dates perfectly and we verify this fact experimentally with an example based method, while a statistical method fails at translating 345 dates in the 4,018 dates to translate.
Fichier principal
Vignette du fichier
ebmt3-translation-sublanguages.pdf (375.13 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00439890 , version 1 (08-12-2009)

Identifiants

  • HAL Id : hal-00439890 , version 1

Citer

Julien Gosme, Yves Lepage, Adrien Lardilleux. Translation of sublanguages by subgrammars. 3rd International Workshop on Example-Based Machine Translation, Nov 2009, Dublin, Ireland. pp.77 à 84. ⟨hal-00439890⟩
95 Consultations
120 Téléchargements

Partager

Gmail Facebook X LinkedIn More