W. Aransa, H. Schwenk, and L. Barrault, Improving continuous space language models using auxiliary features, Proceedings of the 12th International Workshop on Spoken Language Translation, pp.151-158, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01454941

D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, 2014.

O. Bojar, R. Chatterjee, C. Federmann, B. Haddow, M. Huck et al., Findings of the 2015 workshop on statistical machine translation Association for Computational Linguis- tics, Proceedings of the Tenth Workshop on Statistical Machine Translation, pp.1-46, 2015.

J. Chung, C. ¸. Aglar-gülçehre, K. Cho, and Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence modeling, 2014.

D. Elliott, S. Frank, and E. Hasler, Multi-language image description with neural sequence models, 2015.

O. Firat, K. Cho, and Y. Bengio, Multi-way, multilingual neural machine translation with a shared attention mechanism. arXiv preprint, 2016.
DOI : 10.18653/v1/n16-1101
URL : http://arxiv.org/pdf/1601.01073

X. Glorot and Y. Bengio, Understanding the difficulty of training deep feedforward neural networks, Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS'10). Society for Artificial Intelligence and Statistics, 2010.

K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition . arXiv preprint, 2015.
DOI : 10.1109/cvpr.2016.90
URL : http://arxiv.org/pdf/1512.03385

K. Heafield, KenLM: faster and smaller language model queries, Proceedings of the EMNLP 2011 Sixth Workshop on Statistical Machine Translation, pp.187-197, 2011.

N. Kalchbrenner and P. Blunsom, Recurrent continuous translation models, 2013.

R. Kiros, R. Salakhutdinov, and R. Zemel, Multimodal neural language models, Proceedings of the 31st International Conference on Machine Learning (ICML-14) Conference Proceedings, pp.595-603, 2014.

R. Kiros, R. Salakhutdinov, and R. S. Zemel, Unifying visual-semantic embeddings with multimodal neural language models, 1411.

P. Koehn, H. Hoang, A. Birch, C. Callison-burch, M. Federico et al., Moses, Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, ACL '07, pp.177-180, 2007.
DOI : 10.3115/1557769.1557821

A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet classification with deep convolutional neural networks, Advances in neural information processing systems, pp.1097-1105, 2012.
DOI : 10.1162/neco.2009.10-08-881
URL : http://dl.acm.org/ft_gateway.cfm?id=3065386&type=pdf

A. Lavie and A. Agarwal, Meteor, Proceedings of the Second Workshop on Statistical Machine Translation, StatMT '07, pp.228-231, 2007.
DOI : 10.3115/1626355.1626389

V. Quoc, T. Le, and . Mikolov, Distributed representations of sentences and documents, 2014.

T. Mikolov, M. Karafiát, and L. B. Cernock, Cernock`y, and Sanjeev Khudanpur Recurrent neural network based language model, In IN- TERSPEECH, vol.2, p.3, 2010.

F. Och and H. Ney, A Systematic Comparison of Various Statistical Alignment Models, Computational Linguistics, vol.22, issue.1, pp.19-51, 2003.
DOI : 10.1109/89.817451

F. Och, Minimum error rate training in statistical machine translation, Proceedings of the 41st Annual Meeting on Association for Computational Linguistics , ACL '03, pp.160-167, 2003.
DOI : 10.3115/1075096.1075117
URL : http://acl.ldc.upenn.edu/acl2003/main/ps/Och.ps

K. Papineni, S. Roukos, T. Ward, and W. Zhu, BLEU, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, pp.311-318, 2002.
DOI : 10.3115/1073083.1073135

H. Schwenk, Continuous space language models for statistical machine translation, The Prague Bulletin of Mathematical Linguistics, pp.137-146, 2010.
DOI : 10.3115/1273073.1273166
URL : https://hal.archives-ouvertes.fr/hal-01433882

R. Sennrich and B. Haddow, A Joint Dependency Model of Morphological and Syntactic Structure for Statistical Machine Translation, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp.114-121, 2015.
DOI : 10.18653/v1/D15-1248

K. Simonyan and A. Zisserman, Very deep convolutional networks for large-scale image recognition, 2014.

A. Stolcke, Srilm -an extensible language modeling toolkit, Proceedings of the 7th International Conference on Spoken Language Processing, pp.901-904, 2002.

I. Sutskever, O. Vinyals, and Q. V. Le, Sequence to sequence learning with neural networks, 2014.

J. Xiao, J. Hays, A. Krista, A. Ehinger, A. Oliva et al., SUN database: Large-scale scene recognition from abbey to zoo, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.3485-3492, 2010.
DOI : 10.1109/CVPR.2010.5539970
URL : http://cs.brown.edu/~hays/papers/sun.pdf

K. Xu, J. Ba, R. Kiros, K. Cho, and A. Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention, Proceedings of The 32nd International Conference on Machine Learning, pp.2048-2057