Skip to Main content Skip to Navigation
Conference papers

Does Multimodality Help Human and Machine for Translation and Image Captioning?

Abstract : This paper presents the systems developed by LIUM and CVC for the WMT16 Multimodal Machine Translation challenge. We explored various comparative methods, namely phrase-based systems and attentional recurrent neural networks models trained using monomodal or multimodal data. We also performed a human evaluation in order to estimate the usefulness of multimodal data for human machine translation and image description generation. Our systems obtained the best results for both tasks according to the automatic evaluation metrics BLEU and METEOR.
Complete list of metadatas

Cited literature [27 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01433183
Contributor : Sylvain Meignier <>
Submitted on : Wednesday, January 17, 2018 - 12:54:42 PM
Last modification on : Friday, April 26, 2019 - 1:54:02 PM
Document(s) archivé(s) le : Monday, May 7, 2018 - 4:57:19 PM

File

wmt16_multimodal_LIUMCVC.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01433183, version 1

Collections

Citation

Ozan Caglayan, Walid Aransa, Yaxing Wang, Marc Masana, Mercedes Garcia-Martinez, et al.. Does Multimodality Help Human and Machine for Translation and Image Captioning?. First Conference on Machine Translation, Aug 2016, Berlin, Germany. pp.627-633. ⟨hal-01433183⟩

Share

Metrics

Record views

264

Files downloads

132