The University of Edinburgh-Uppsala University’s Submission to the WMT 2020 Chat Translation Task - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

The University of Edinburgh-Uppsala University’s Submission to the WMT 2020 Chat Translation Task

Nikita Moghe
  • Fonction : Auteur
  • PersonId : 1080204
Christian Hardmeier
  • Fonction : Auteur
  • PersonId : 1080205
Rachel Bawden

Résumé

This paper describes the joint submission of the University of Edinburgh and Uppsala University to the WMT'20 chat translation task for both language directions (English↔German). We use existing state-of-the-art machine translation models trained on news data and fine-tune them on in-domain and pseudo-in-domain web crawled data. We also experiment with (i) adaptation using speaker and domain tags and (ii) using different types and amounts of preceding context. We observe that contrarily to expectations, exploiting context degrades the results (and on analysis the data is not highly contextual). However using domain tags does improve scores according to the automatic evaluation. Our final primary systems use domain tags and are ensembles of 4 models, with noisy channel reranking of outputs. Our en-de system was ranked second in the shared task while our de-en system outperformed all the other systems.
Fichier principal
Vignette du fichier
WMT_2020_Chat.pdf (162.48 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02981159 , version 1 (27-10-2020)

Identifiants

  • HAL Id : hal-02981159 , version 1

Citer

Nikita Moghe, Christian Hardmeier, Rachel Bawden. The University of Edinburgh-Uppsala University’s Submission to the WMT 2020 Chat Translation Task. 5th Conference on Machine Translation, Nov 2020, Online, Unknown Region. ⟨hal-02981159⟩
131 Consultations
84 Téléchargements

Partager

Gmail Facebook X LinkedIn More