Combination of deep learning and syntactical approaches for the interpretation of interactions between text-lines and tabular structures in handwritten documents - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Combination of deep learning and syntactical approaches for the interpretation of interactions between text-lines and tabular structures in handwritten documents

Résumé

In this article, we present our work on baseline detection in images of historical documents. This work focuses on handwritten documents containing tabular structures. One of the difficulties of this kind of documents is the strong interaction between text and tabular structures. This interaction leads to ambiguous cases for which recognition systems often over-or sub-segment baselines. The interest of our method is to combine contextual and structural knowledge in order to interpret properly this interaction. Our combination is able to merge heterogeneous information obtained with a deep-learning approach (for contextual elements) and a syntactical approach (for structural elements). Our grammatical description consists on a logical description of the intersections between text-lines and vertical rulings of detected tables. Intersections are described thanks to physical indicators extracted from images: vertical rulings, hypothetical text-lines, begin-and end-indicators of text-lines. We show on cBAD competition [4] (competition on baseline detection) that the combination of heterogeneous knowledge (structural and contextual information) improves baseline detection in handwritten documents. We obtain better scores than the best method published until now on this competition.
Fichier principal
Vignette du fichier
Combination of deep learning and syntactical approaches for the interpretation of interactions between text-lines and tabular structures in handwritten documents.pdf (23.66 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02303293 , version 1 (02-10-2019)

Identifiants

  • HAL Id : hal-02303293 , version 1

Citer

Camille Guerry, Bertrand B. Coüasnon, Aurélie Lemaitre. Combination of deep learning and syntactical approaches for the interpretation of interactions between text-lines and tabular structures in handwritten documents. 15th International Conference on Document Analysis and Recognition (ICDAR), Sep 2019, Sydney, Australia. ⟨hal-02303293⟩
125 Consultations
28 Téléchargements

Partager

Gmail Facebook X LinkedIn More