Learning Texture Features for Enhancement and Segmentation of Historical Document Images

Abstract : Many challenges and open issues related to the tremendous growth in digitizing collections of cultural heritage documents have been raised, such as information retrieval in digital libraries or analyzing page content of historical books. Recently, graphic/text segmentation in historical documents has posed specific challenges due to many particularities of historical document images (e.g. noise and degradation, presence of handwriting, overlapping layouts, great variability of page layout). To cope with those challenges, a method based on learning texture features for historical document image enhancement and segmentation is proposed in this article. The proposed method is based on using the simple linear iterative clustering (SLIC) superpixels, Gabor de-scriptors and support vector machines (SVM). It has been evaluated on 100 document images which have been selected from the databases of the competitions (i.e. historical document layout analysis and historical book recognition) in the context of ICDAR conference and HIP workshop (2011 and 2013). To demonstrate the enhancement and segmentation quality, the evaluation is based on manually labeled ground truth and shows the effectiveness of the proposed method through qualitative and numerical experiments. The proposed method provides interesting results on historical document images having various page layouts and different typographical and graphical properties.
Type de document :
Communication dans un congrès
ACM. International Workshop on Historical Document Imaging and Processing (HIP), Aug 2015, Nancy, France. pp.47-54, 2015
Liste complète des métadonnées

Littérature citée [26 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01237228
Contributeur : Maroua Mehri <>
Soumis le : mercredi 2 décembre 2015 - 22:35:25
Dernière modification le : mercredi 11 octobre 2017 - 11:18:01
Document(s) archivé(s) le : jeudi 3 mars 2016 - 15:01:31

Fichier

MarouaMEHRI_HIP_2015.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01237228, version 1

Collections

Citation

Maroua Mehri, Nibal Nayef, Pierre Héroux, Petra Gomez-Krämer, Rémy Mullot. Learning Texture Features for Enhancement and Segmentation of Historical Document Images. ACM. International Workshop on Historical Document Imaging and Processing (HIP), Aug 2015, Nancy, France. pp.47-54, 2015. 〈hal-01237228〉

Partager

Métriques

Consultations de
la notice

129

Téléchargements du document

60