D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, Proceedings of the International Conference on Learning Representations, 2015.

S. Bengio, O. Vinyals, N. Jaitly, and N. Shazeer, Scheduled sampling for sequence prediction with recurrent neural networks, Proceedings of the 28th International Conference on Neural Information Processing Systems, vol.1, pp.1171-1179, 2015.

O. Caglayan, A. Bardet, F. Bougares, L. Barrault, K. Wang et al., LIUM-CVC submissions for WMT18 multimodal translation task, Shared Task Papers, Brussels, Belgium. Association for Computational Linguistics, vol.2, 2018.

O. Caglayan, L. Barrault, and F. Bougares, Multimodal attention for neural machine translation, 2016.

O. Caglayan, M. García-martínez, A. Bardet, . Walid, . Aransa et al., NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems. CoRR, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01647873

H. Jonathan, C. Clark, A. Dyer, N. Lavie, and . Smith, Better hypothesis testing for statistical machine translation: Controlling for optimizer instability, 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp.176-181, 2011.

J. Delbrouck and S. Dupont, Umons submission for wmt18 multimodal translation task, Proceedings of the Third Conference on Machine Translation, Brussels, Belgium. Association for Computational Linguistics, 2018.

M. Denkowski and A. Lavie, Meteor universal: Language specific translation evaluation for any target language, EACL 2014 Workshop on Statistical Machine Translation, 2014.

C. Dyer, V. Chahuneau, and N. Smith, A simple, fast, and effective reparameterization of IBM model 2, Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2013), pp.644-649, 2013.

D. Elliott, S. Frank, L. Barrault, F. Bougares, and L. Specia, Findings of the second shared task on multimodal machine translation and multilingual image description, Proceedings of the Second Conference on Machine Translation, vol.2, pp.215-233, 2017.

D. Elliott, S. Frank, and E. Hasler, Multi-language image description with neural sequence models, 2015.

D. Elliott, S. Frank, K. Simaan, and L. Specia, Multi30k: Multilingual englishgerman image descriptions, 5th Workshop on Vision and Language, pp.70-74, 2016.

D. Elliott and . Kádár, Imagination improves Multimodal Translation, Proceedings of the Eighth International Joint Conference on Natural Language Processing, vol.1, pp.130-141, 2017.

C. Federmann, Appraise: An open-source toolkit for manual evaluation of machine translation output, The Prague Bulletin of Mathematical Linguistics, vol.98, pp.25-35, 2012.

R. Girshick, I. Radosavovic, G. Gkioxari, P. Dollár, and K. He, , 2018.

Y. Graham, T. Baldwin, A. Moffat, and J. Zobel, Can machine translation systems be evaluated by the crowd alone, Natural Language Engineering, vol.23, issue.1, pp.3-30, 2015.

S. Grönroos, B. Huet, M. Kurimo, J. Laaksonen, B. Merialdo et al., The MeMAD submission to the WMT18 multimodal translation task, Proceedings of the Third Conference on Machine Translation, 2018.

J. Gwinnup, J. Sandvick, M. Hutt, G. Erdmann, J. Duselis et al., The afrl-ohio state wmt18 multimodal system: Combining visual with traditional, Proceedings of the Third Conference on Machine Translation, Brussels, Belgium. Association for Computational Linguistics, 2018.

K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, IEEE Conference on Computer Vision and Pattern Recognition, pp.770-778, 2016.

J. Helcl, J. Libovick´ylibovick´y, and D. Vari?, CUNI system for the WMT18 multimodal translation tasks, Proceedings of the Third Conference on Machine Translation, 2018.

J. Hitschler, S. Schamoni, and S. Riezler, Multimodal Pivots for Image Caption Translation, 54th Annual Meeting of the Association for Computational Linguistics, pp.2399-2409, 2016.

M. Junczys-dowmunt, R. Grundkiewicz, T. Dwojak, H. Hoang, K. Heafield et al., Marian: Fast neural machine translation in C++, Proceedings of ACL 2018, System Demonstrations, pp.116-121, 2018.

P. Koehn, H. Hoang, A. Birch, C. Callison-burch, M. Federico et al., Moses: Open source toolkit for statistical machine translation, 45th Annual meeting of Association for Computational Linguistics, pp.177-180, 2007.

C. Lala, P. Madhyastha, C. Scarton, and L. Specia, Sheffield submissions for wmt18 multimodal translation shared task, Proceedings of the Third Conference on Machine Translation, Brussels, Belgium. Association for Computational Linguistics, 2018.

C. Lala and L. Specia, Multimodal Lexical Translation, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), 2018.

T. Lin, M. Maire, S. J. Belongie, L. D. Bourdev, R. B. Girshick et al., Microsoft COCO: common objects in context. CoRR, 2014.

P. Lison and J. Tiedemann, Opensubtitles2016: Extracting large parallel corpora from movie and tv subtitles, Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC, 2016.

K. Papineni, S. Roukos, T. Ward, and W. Zhu, Bleu: A method for automatic evaluation of machine translation, 40th Annual Meeting on Association for Computational Linguistics, pp.311-318, 2002.

A. Raganato, C. D. Bovi, and R. Navigli, Neural sequence learning models for word sense disambiguation, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp.1156-1167, 2017.

J. Steven, E. Rennie, Y. Marcheret, J. Mroueh, V. Ross et al., Self-critical sequence training for image captioning, IEEE Conference on Computer Vision and Pattern Recognition, pp.1179-1195, 2017.

M. Riedl and C. Biemann, Unsupervised compound splitting with distributional semantics rivals supervised methods, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.617-622, 2016.

M. Snover, B. Dorr, R. Schwartz, L. Micciulla, and J. Makhoul, A study of translation edit rate with targeted human annotation, Proceedings of Association for Machine Translation in the Americas, 2006.

L. Specia, S. Frank, K. Sima, and D. Elliott, A shared task on multimodal machine translation and crosslingual image description, First Conference on Machine Translation, pp.543-553, 2016.

J. Straková, M. Straka, and J. Haji?, Open-Source Tools for Morphology, Lemmatization, POS Tagging and Named Entity Recognition, Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp.13-18, 2014.

A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones et al., Attention is all you need, Advances in Neural Information Processing Systems, pp.5998-6008, 2017.

P. Young, A. Lai, M. Hodosh, and J. Hockenmaier, From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions, Transactions of the Association for Computational Linguistics, vol.2, pp.67-78, 2014.

D. Yuan, J. Richardson, R. Doherty, C. Evans, and E. Altendorf, Semi-supervised word sense disambiguation with neural models, 2016.

R. Zheng, Y. Yang, M. Ma, and L. Huang, Ensemble sequence level training for multimodal mt: Osu-baidu wmt18 multimodal machine translation system report, Proceedings of the Third Conference on Machine Translation, Brussels, Belgium. Association for Computational Linguistics, 2018.