Learning text-line localization with shared and local regression neural networks - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Learning text-line localization with shared and local regression neural networks

Jérôme Louradour
  • Fonction : Auteur
Christopher Kermorvant

Résumé

Text line detection and localisation is a crucial step for full page document analysis, but still suffers from heterogeneity of real life documents. In this paper, we present a novel approach for text line localisation based on Convolutional Neural Networks and Multidimensional Long Short-Term Memory cells as a regressor in order to predict the coordinates of the text line bounding boxes directly from the pixel values. Targeting typically large images in document image analysis, we propose a new model using weight sharing over local blocks. We compare two strategies: directly predicting the four coordinates or predicting lower-left and upper-right points separately followed by matching. We evaluate our work on the highly unconstrained Maurdor dataset and show that our method outperforms both other machine learning and image processing methods.
Fichier non déposé

Dates et versions

hal-01345713 , version 1 (15-07-2016)

Identifiants

  • HAL Id : hal-01345713 , version 1

Citer

Bastien Moysset, Jérôme Louradour, Christopher Kermorvant, Christian Wolf. Learning text-line localization with shared and local regression neural networks. International Conference on Frontiers in Handwriting Recognition, Oct 2016, Shenzhen, China. ⟨hal-01345713⟩
189 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More