A mixed approach for handwritten documents structural analysis

Vincent Malleron 1 Véronique Eglin 1
1 imagine - Extraction de Caractéristiques et Identification
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : In this paper we propose a new method for document pages segmentation. First dedicated to handwritten documents, our method is designed to extract the different text zones, paragraph and fragment in unconstraint documents. The proposed approach is a mixed one, using both the advantages of top-down and bottom-up approaches. In this paper we proposed and evaluation of our methods on a 183 documents database, taken from a 19th century handwritten corpus : the "dossiers de Bouvard et Pécuchet" from Flaubert. With this evaluation we demonstrate that the combination of the top-down and the bottom-up approach allow to improve the obtained results.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01354452
Contributor : Équipe Gestionnaire Des Publications Si Liris <>
Submitted on : Thursday, August 18, 2016 - 7:28:08 PM
Last modification on : Friday, January 11, 2019 - 5:08:46 PM

Identifiers

Citation

Vincent Malleron, Véronique Eglin. A mixed approach for handwritten documents structural analysis. International Conference on Document Analysis and Recognition, Sep 2011, Beijing, China. pp.269-273, ⟨10.1109/ICDAR.2011.62⟩. ⟨hal-01354452⟩

Share

Metrics

Record views

101