Introduction of statistical information in a syntactic analyser for document image recognition

André O. Maroneze 1 Bertrand Coüasnon 1 Aurélie Lemaitre 1
1 IMADOC - Interprétation et Reconnaissance d’Images et de Documents
UR1 - Université de Rennes 1, INSA Rennes - Institut National des Sciences Appliquées - Rennes, CNRS - Centre National de la Recherche Scientifique : UMR6074
Abstract : This paper presents an improvement to document layout analysis systems, oering a possible solution to Sayre's paradox (which states that an element must be recognized before it can be segmented; and it must be segmented before it can be recognized). This improvement, based on stochastic parsing, allows integration of statistical information, obtained from recognizers, during syntactic layout analysis. We present how this fusion of numeric and symbolic information in a feedback loop can be applied to syntactic methods to improve document description expressiveness. To limit combinatorial explosion during exploration of solutions, we devised an operator that allows optional activation of the stochastic parsing mechanism. Our evaluation on 1250 handwritten business letters shows this method allows the improvement of global recognition scores.
Type de document :
Communication dans un congrès
Document recognition and Retrieval XVIII - Electronic Imaging, Jan 2011, San Francisco, United States. pp.7874 04, 2011
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-00567077
Contributeur : Aurélie Lemaitre <>
Soumis le : vendredi 18 février 2011 - 10:03:05
Dernière modification le : vendredi 16 novembre 2018 - 01:27:54
Document(s) archivé(s) le : mardi 6 novembre 2012 - 14:15:52

Fichier

maronezeDRR.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00567077, version 1

Citation

André O. Maroneze, Bertrand Coüasnon, Aurélie Lemaitre. Introduction of statistical information in a syntactic analyser for document image recognition. Document recognition and Retrieval XVIII - Electronic Imaging, Jan 2011, San Francisco, United States. pp.7874 04, 2011. 〈hal-00567077〉

Partager

Métriques

Consultations de la notice

297

Téléchargements de fichiers

88