Lexical Micro-adaptation for Neural Machine Translation - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Lexical Micro-adaptation for Neural Machine Translation

Jitao Xu
  • Fonction : Auteur
  • PersonId : 184998
  • IdHAL : xujitao
Josep Crego
  • Fonction : Auteur
Jean Senellart
  • Fonction : Auteur
  • PersonId : 1038145

Résumé

This work is inspired by a typical machine translation industry scenario in which translators make use of in-domain data for facilitating translation of similar or repeating sentences. We introduce a generic framework applied at inference in which a subset of segment pairs are first extracted from training data according to their similarity to the input sentences. These segments are then used to dynamically update the parameters of a generic NMT network, thus performing a lexical micro-adaptation. Our approach demonstrates strong adaptation performance to new and existing datasets including pseudo in-domain data. We evaluate our approach on a heterogeneous English-French training dataset showing accuracy gains on all evaluated domains when compared to strong adaptation baselines.
Fichier principal
Vignette du fichier
IWSLT2019_paper_9.pdf (422.2 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02635039 , version 1 (27-05-2020)

Identifiants

  • HAL Id : hal-02635039 , version 1

Citer

Jitao Xu, Josep Crego, Jean Senellart. Lexical Micro-adaptation for Neural Machine Translation. International Workshop on Spoken Language Translation, Nov 2019, Hong Kong, Hong Kong SAR China. ⟨hal-02635039⟩
101 Consultations
66 Téléchargements

Partager

Gmail Facebook X LinkedIn More