D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, 2014.

S. Bengio and G. Heigold, Word embeddings for speech recognition, Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech'14), 2014.

T. Bluche, J. Louradour, and R. Messina, Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention, Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR'17), 2017.

P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, Enriching word vectors with subword information, Transactions of the Association of Computational Linguistics, vol.5, issue.1, pp.135-146, 2017.

M. Bollmann, J. Bingel, and A. Søgaard, Learning attention for historical text normalization by learning to pronounce, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL'17), pp.332-344, 2017.

K. Cho, B. Van-merrienboer, D. Bahdanau, and Y. Bengio, On the Properties of Neural Machine Translation: Encoder-Decoder Approaches, Proceedings of the 8th Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST'14), pp.103-111, 2014.

F. Cloppet, V. Eglin, D. Van-cuong-kieu, N. Stutzmann, and . Vincent, ICFHR2016 Competition on Classification of Medieval Handwritings in Latin Script, Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition (ICFHR'16), pp.590-595, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01403775

A. Fischer, A. Keller, V. Frinken, and H. Bunke, Lexicon-free handwritten word spotting using character HMMs, PRL, vol.33, issue.7, pp.934-942, 2012.

A. Volkmar-frinken and C. Fischer, Handwriting recognition in historical documents using very large vocabularies, Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing, pp.67-72, 2013.

D. Garrette and H. , An unsupervised model of orthographic variation for historical document transcription, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HTL'16), pp.467-472, 2016.

E. Granell, E. Chammas, L. Likforman-sulem, C. Martínez-hinarejos, C. Mokbel et al., Transcription of spanish historical handwritten documents with deep neural networks, Journal of Imaging, vol.4, issue.1, p.15, 2018.

A. Granet, B. Hervy, G. Roman-jimenez, M. Hachicha, E. Morin et al., Crowdsourcing-based Annotation of the Accounting Registers of the Italian Comedy, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), 2018.
URL : https://hal.archives-ouvertes.fr/hal-01819079

A. Granet, E. Morin, H. Mouchre, S. Quiniou, and C. Viard-gaudin, Transfer learning for handwriting recognition on historical documents, Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods ICPRAM, pp.432-439, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01681126

A. Graves, S. Fernández, F. Gomez, and J. Schmidhuber, Connectionist Temporal Classification: Labelling Unsegmented Sequence Data with Recurrent Neural Networks, Proceedings of the 23rd International Conference on Machine Learning (ICML'06), pp.369-376, 2006.

E. Grosicki and H. El-abed, ICDAR 2011-French Handwriting Recognition Competition, Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR'11), pp.1459-1463, 2011.

P. Huang, X. He, J. Gao, L. Deng, A. Acero et al., Learning deep structured semantic models for web search using clickthrough data, Proceedings of the 22nd ACM international conference on Conference on information & knowledge management, pp.2333-2338, 2013.

J. Lladós, M. Rusiñol, A. Fornés, D. Fernández, and A. Dutta, On the influence of word representations for handwritten word spotting in historical documents, IJPRAI, vol.26, issue.05, pp.1263002-1263003, 2012.

V. Nair and G. E. Hinton, Rectified Linear Units Improve Restricted Boltzmann Machines, Proceedings of the 27th international conference on machine learning (ICML'10), pp.807-814, 2010.

H. Nakayama and N. Nishida, Zero-resource machine translation by multimodal encoder-decoder network with multimedia pivot. Machine Translation, vol.31, pp.49-64, 2017.

Q. Sinno-jialin-pan and . Yang, A survey on transfer learning, IEEE Transactions on knowledge and data engineering, vol.22, issue.10, pp.1345-1359, 2010.

I. Pratikakis, K. Zagoris, G. Barlas, and B. Gatos, Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition (ICFHR'16), pp.619-623, 2016.

J. A. Sanchez, V. Romero, A. H. Toselli, M. Villegas, and E. Vidal, ICDAR2017 Competition on Handwritten Text Recognition on the READ Dataset, Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR'17), pp.1383-1388, 2017.

C. Vania and A. Lopez, From Characters to Words to in Between: Do We Capture Morphology?, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL'17), pp.2016-2027, 2017.

O. Vinyals, A. Toshev, S. Bengio, and D. Erhan, Show and tell: A neural image caption generator, Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on, pp.3156-3164, 2015.