Complex documents images segmentation based on steerable pyramid features

Mohamed Benjelil; Slim Kanoun; Rémy Mullot; Adel Alimi

doi:10.1007/s10032-010-0113-9

Article Dans Une Revue International Journal in Document Aanalysis and Recognition (IJDAR) Année : 2010

Complex documents images segmentation based on steerable pyramid features

(1) , (1) , (2) , (1)

1
2

Mohamed Benjelil

Fonction : Auteur

REsearch Group in Intelligent Machines [Sfax]

Slim Kanoun

Fonction : Auteur

REsearch Group in Intelligent Machines [Sfax]

Rémy Mullot

Fonction : Auteur

Laboratoire Informatique, Image et Interaction - EA 2118

Adel Alimi

Fonction : Auteur
PersonId : 756240
ORCID : 0000-0002-0642-3384

REsearch Group in Intelligent Machines [Sfax]

Résumé

Page segmentation and classification is very important in document layout analysis system before it is presented to an OCR system or for any other subsequent processing steps. In this paper, we propose an accurate and suitably designed system for complex documents segmentation. This system is based on steerable pyramid transform. The features extracted from pyramid sub-bands serve to locate and classify regions into text (either machine-printed or handwritten) and non-text (images, graphics, drawings or paintings) in some noise-infected, deformed, multilingual, multi-script document images. These documents contain tabular structures, logos, stamps, handwritten script blocks, photographs, etc. The encouraging and promising results obtained on 1,000 official complex document images data set are presented in this research paper. We compared our results with those from existing state-of-the-art methods. This comparison shows that the proposed method performs consistently well on large sets of complex document images.

Mots clés

Steerable pyramid Complex document segmentation Multi-resolution analysis invariant features

Domaines

Informatique Traitement des images [eess.IV]

Rémy Mullot : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00495728

Soumis le : lundi 28 juin 2010-16:10:38

Dernière modification le : jeudi 12 mai 2022-15:35:57

Dates et versions

hal-00495728 , version 1 (28-06-2010)

Identifiants

HAL Id : hal-00495728 , version 1
DOI : 10.1007/s10032-010-0113-9

Citer

Mohamed Benjelil, Slim Kanoun, Rémy Mullot, Adel Alimi. Complex documents images segmentation based on steerable pyramid features. International Journal in Document Aanalysis and Recognition (IJDAR), 2010, 13 (1), pp.Online ISSN 1433-2833. ⟨10.1007/s10032-010-0113-9⟩. ⟨hal-00495728⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

L3I UNIV-ROCHELLE

97 Consultations

0 Téléchargements

Complex documents images segmentation based on steerable pyramid features

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager