Transfer Learning for Handwriting Recognition on Historical Documents

Adeline Granet 1, 2, 3 Emmanuel Morin 1, 3 Harold Mouchère 1, 2 Solen Quiniou 1, 3 Christian Viard-Gaudin 1, 2
2 IPI - Image Perception Interaction
LS2N - Laboratoire des Sciences du Numérique de Nantes
3 TALN - Traitement Automatique du Langage Naturel
LS2N - Laboratoire des Sciences du Numérique de Nantes
Abstract : In this work, we investigate handwriting recognition on new historical handwritten documents using transfer learning. Establishing a manual ground-truth of a new collection of handwritten documents is time consuming but needed to train and to test recognition systems. We want to implement a recognition system without performing this annotation step. Our research deals with transfer learning from heterogeneous datasets with a ground-truth and sharing common properties with a new dataset that has no ground-truth. The main difficulties of transfer learning lie in changes in the writing style, the vocabulary, and the named entities over centuries and datasets. In our experiment, we show how a CNN-BLSTM-CTC neural network behaves, for the task of transcribing handwritten titles of plays of the Italian Comedy, when trained on combinations of various datasets such as RIMES, Georges Washington, and Los Esposalles. We show that the choice of the training datasets and the merging methods are determinant to the results of the transfer learning task.
Type de document :
Communication dans un congrès
International Conference on Pattern Recognition Applications and Methods, Jan 2018, Madeira, Portugal
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01681126
Contributeur : Harold Mouchère <>
Soumis le : jeudi 11 janvier 2018 - 13:25:31
Dernière modification le : jeudi 1 février 2018 - 13:46:01

Identifiants

  • HAL Id : hal-01681126, version 1

Collections

Citation

Adeline Granet, Emmanuel Morin, Harold Mouchère, Solen Quiniou, Christian Viard-Gaudin. Transfer Learning for Handwriting Recognition on Historical Documents. International Conference on Pattern Recognition Applications and Methods, Jan 2018, Madeira, Portugal. 〈hal-01681126〉

Partager

Métriques

Consultations de la notice

90