Skip to Main content Skip to Navigation
Conference papers

Combination of deep learning and syntactical approaches for the interpretation of interactions between text-lines and tabular structures in handwritten documents

Abstract : In this article, we present our work on baseline detection in images of historical documents. This work focuses on handwritten documents containing tabular structures. One of the difficulties of this kind of documents is the strong interaction between text and tabular structures. This interaction leads to ambiguous cases for which recognition systems often over-or sub-segment baselines. The interest of our method is to combine contextual and structural knowledge in order to interpret properly this interaction. Our combination is able to merge heterogeneous information obtained with a deep-learning approach (for contextual elements) and a syntactical approach (for structural elements). Our grammatical description consists on a logical description of the intersections between text-lines and vertical rulings of detected tables. Intersections are described thanks to physical indicators extracted from images: vertical rulings, hypothetical text-lines, begin-and end-indicators of text-lines. We show on cBAD competition [4] (competition on baseline detection) that the combination of heterogeneous knowledge (structural and contextual information) improves baseline detection in handwritten documents. We obtain better scores than the best method published until now on this competition.
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02303293
Contributor : Camille Guerry <>
Submitted on : Wednesday, October 2, 2019 - 10:58:09 AM
Last modification on : Thursday, August 20, 2020 - 12:46:03 AM

File

Combination of deep learning a...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02303293, version 1

Citation

Camille Guerry, Bertrand Coüasnon, Aurélie Lemaitre. Combination of deep learning and syntactical approaches for the interpretation of interactions between text-lines and tabular structures in handwritten documents. 15th International Conference on Document Analysis and Recognition (ICDAR), Sep 2019, Sydney, Australia. ⟨hal-02303293⟩

Share

Metrics

Record views

105

Files downloads

211