Classification of business documents for real time application

Djamel Gaceb 1 Véronique Eglin 1 Frank Le Bourgeois 1
1 imagine - Extraction de Caractéristiques et Identification
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : In this paper, we present a new document classification based on physical layout features and graph b-coloring modeling. In order to reduce the computed time and to increase the performance of our automatic reading system, we propose to pre-classify the business documents by introducing an Automatic Recognition of Documents stage (ARD) as a pre-analysis phase. This phase guides the others involved in the recognition process of the documents contents. Once the document type identified, the reading system will use its corresponding information source to improve the recognition of its logical layout, the selection and parameterization of the OCR, and the final decision of sorting. The graph coloring model is introduced for both layout analysis and document classification. The proposed method is reliable, robust to various constraints and guarantees a real-time answer to the sorting ofbusiness documents.
Document type :
Journal articles
Complete list of metadatas
Contributor : Équipe Gestionnaire Des Publications Si Liris <>
Submitted on : Monday, April 11, 2016 - 2:56:09 PM
Last modification on : Thursday, February 7, 2019 - 2:27:28 PM



Djamel Gaceb, Véronique Eglin, Frank Le Bourgeois. Classification of business documents for real time application. Journal of Real-Time Image Processing, Springer Verlag, 2014, 9, pp.329-345. ⟨10.1007/s11554-011-0227-4⟩. ⟨hal-01300863⟩



Record views