A. Allauzen, J. M. Crego, F. Ilknur-durgar-el-kahlout, and . Yvon, LIMSI's statistical translation systems for WMT'10, Proc. of the Joint Workshop on Statistical Machine Translation and MetricsMATR, pp.54-59, 2010.

Y. Bengio, R. Ducharme, P. Vincent, and C. Janvin, Neural Probabilistic Language Models, JMLR, vol.3, pp.1137-1155, 2003.
DOI : 10.1007/3-540-33486-6_6
URL : https://hal.archives-ouvertes.fr/hal-01434258

F. Casacuberta and E. Vidal, Machine Translation with Inferred Stochastic Finite-State Transducers, Computational Linguistics, vol.23, issue.2, pp.205-225, 2004.
DOI : 10.1109/34.211465
URL : http://doi.org/10.1162/089120104323093294

F. Stanley, J. T. Chen, and . Goodman, An empirical study of smoothing techniques for language modeling, 1998.

M. Collins, P. Koehn, and I. Kucerova, Clause restructuring for statistical machine translation, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics , ACL '05, pp.531-540, 2005.
DOI : 10.3115/1219840.1219906
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.108.7612

M. Josep, J. B. Crego, and . Mariño, Improving statistical MT by coupling reordering and decoding. Machine Translation, pp.199-215, 2006.

M. Josep, F. Crego, J. B. Yvon, and . Mariño, N-code: an open-source Bilingual N-gram SMT Toolkit, Prague Bulletin of Mathematical Linguistics, vol.96, pp.49-58, 2011.

D. Déchelotte, G. Adda, A. Allauzen, O. Galibert, J. Gauvain et al., LIMSI's statistical translation systems for WMT'08, Proceedings of the Third Workshop on Statistical Machine Translation, StatMT '08, 2008.
DOI : 10.3115/1626394.1626404

A. Deoras, T. Mikolov, S. Kombrink, and K. Church, Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model, Speech Communication, vol.55, issue.1, pp.162-177, 2013.
DOI : 10.1016/j.specom.2012.08.004

F. Ilknur-durgar-el-kahlout and . Yvon, The pay-offs of preprocessing for German-English Statistical Machine Translation, Proceedings of the seventh International Workshop on Spoken Language Translation (IWSLT), pp.251-258, 2010.

R. Kneser and H. Ney, Improved backing-off for M-gram language modeling, 1995 International Conference on Acoustics, Speech, and Signal Processing, pp.181-184, 1995.
DOI : 10.1109/ICASSP.1995.479394

P. Koehn and H. Hoang, Factored translation models, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp.868-876, 2007.

H. Le, I. Oparin, A. Allauzen, J. Gauvain, and F. Yvon, Structured Output Layer neural network language model, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5524-5527, 2011.
DOI : 10.1109/ICASSP.2011.5947610

H. Le, A. Allauzen, and F. Yvon, Continuous space translation models with neural networks, NAACL '12: Proceedings of the 2012 Conference of the North American Chapter, 2012.

H. Le, A. Allauzen, and F. Yvon, Measuring the influence of long range dependencies with neural network language models, Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, pp.1-10, 2012.

G. Lembersky, N. Ordan, and S. Wintner, Language Models for Machine Translation: Original vs. Translated Texts, Computational Linguistics, vol.11, issue.2, pp.799-825, 2012.
DOI : 10.1075/btl.4
URL : http://doi.org/10.1162/coli_a_00111

B. José, R. E. Mariño, J. M. Banchs, . Crego, P. De-gispert et al., N-grambased machine translation, Computational Linguistics, vol.32, issue.4, pp.527-549, 2006.

C. Robert and . Moore, Fast and accurate sentence alignment of bilingual corpora of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users, Proceedings of the 5th Conference, pp.135-144, 2002.

G. Neubig, T. Watanabe, and S. Mori, Inducing a discriminative parser to optimize machine translation reordering, Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp.843-853, 2012.

F. Och, Minimum error rate training in statistical machine translation, Proceedings of the 41st Annual Meeting on Association for Computational Linguistics , ACL '03, pp.160-167, 2003.
DOI : 10.3115/1075096.1075117

L. Padró and E. Stanilovsky, Freeling 3.0: Towards wider multilinguality, Proceedings of the Language Resources and Evaluation Conference, 2012.

K. Papineni, S. Roukos, T. Ward, and W. Zhu, BLEU, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, pp.311-318, 2002.
DOI : 10.3115/1073083.1073135

H. Schmid and F. Laws, Estimation of conditional probabilities with decision trees and an application to fine-grained POS tagging, Proceedings of the 22nd International Conference on Computational Linguistics, COLING '08, pp.777-784, 2008.
DOI : 10.3115/1599081.1599179

H. Schmid, Probabilistic part-of-speech tagging using decision trees, Proc. of International Conference on New Methods in Language Processing, pp.44-49, 1994.

H. Schwenk, D. Déchelotte, and J. Gauvain, Continuous-Space Language Models for Statistical Machine Translation, Proc. COL- ING/ACL'06, pp.723-730, 2006.
DOI : 10.2478/v10108-010-0014-6
URL : https://hal.archives-ouvertes.fr/hal-01433882

I. Sutskever, J. Martens, and G. Hinton, Generating text with recurrent neural networks, Proceedings of the 28th International Conference on Machine Learning (ICML-11), ICML '11, pp.1017-1024, 2011.

C. Tillmann, A unigram orientation model for statistical machine translation, Proceedings of HLT-NAACL 2004: Short Papers on XX, HLT-NAACL '04, pp.101-104, 2004.
DOI : 10.3115/1613984.1614010