AColDPS : Robust and Unsupervised Automatic Color Document Processing System

Louisa Kessi 1 Frank Le Bourgeois 1 Christophe Garcia 1 Jean Duong 1
1 imagine - Extraction de Caractéristiques et Identification
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : This paper presents the first fully automatic color analysis system suited for business documents. Our pixel-based approach uses mainly color morphology and does not require any training , manual assistance , prior knowledge or model. We developed a robust color segmentation system adapted for invoices and forms with significant color complexity and dithered background. The system achieves several operations to segment automatically color images , separate text from noise and graphics and provides color information about text color. The contribution of our work is Tree-fold. Firstly , it is the usage of color morphology to simultaneously segment both text and inverted text. Our system processes inverted and non-inverted text automatically using conditional color dilation and erosion , even in cases where there are overlaps between the two. Secondly , it is the extraction of geodesic measures using morphological convolution in order to separate text , noise and graphical elements. Thirdly , we develop a method to disconnect characters touching or overlapping graphical elements. Our system can separate characters that touch straight lines , split overlapped characters with different colors and separate characters from graphics if they have different colors. A color analysis stage automatically calculates the number of character colors. The proposed system is generic enough to process a wide range of images of digitized business documents from different origins. It outperforms the classical approach that uses binarization of greyscale images .
Document type :
Conference papers
Complete list of metadatas

Cited literature [14 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01272989
Contributor : Louisa Kessi <>
Submitted on : Tuesday, February 16, 2016 - 3:11:37 PM
Last modification on : Tuesday, February 26, 2019 - 11:20:49 AM
Long-term archiving on : Tuesday, May 17, 2016 - 10:05:40 AM

File

VISAPP_2015_258_CR.pdf
Files produced by the author(s)

Identifiers

Citation

Louisa Kessi, Frank Le Bourgeois, Christophe Garcia, Jean Duong. AColDPS : Robust and Unsupervised Automatic Color Document Processing System . VISAPP 2015, 10th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, Mar 2015, Berlin, Germany. ⟨10.5220/0005315801740185⟩. ⟨hal-01272989⟩

Share

Metrics

Record views

500

Files downloads

207