Efficient Multiscale Sauvola's Binarization - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue International Journal on Document Analysis and Recognition Année : 2014

Efficient Multiscale Sauvola's Binarization

Résumé

This work focuses on the most commonly used binarization method: Sauvola's. It performs relatively well on classical documents, however, three main defects remain: the window parameter of Sauvola's formula does not fit automatically to the contents, it is not robust to low contrasts, and it is not invariant with respect to contrast inversion. Thus on documents such as magazines, the contents may not be retrieved correctly, which is crucial for indexing purpose. In this paper we describe how to implement an efficient multiscale implementation of Sauvola's algorithm in order to guarantee good binarization for both small and large objects inside a single document without adjusting manually the window size to the contents. We also describe how to implement it in an efficient way, step by step. This algorithm remains notably fast compared to the original one. For fixed parameters, text recognition rates and bi-narization quality are equal or better than other methods on text with low and medium x-height and is significantly improved on text with large x-height. Pixel-based accuracy and OCR evaluations are performed on more than 120 documents. Compared to awarded methods in the latest binarization contests, Sauvola's formula does not give the best results on historical documents. On the other hand, on clean magazines it out-performs those methods. This implementation improves the robustness of Sauv-ola's algorithm by making the results almost insensible to the window size whatever the object sizes. Its properties make it usable in full document analysis toolchains.
Fichier principal
Vignette du fichier
geraud.2014.ijdar.pdf (4.81 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02181880 , version 1 (12-07-2019)

Identifiants

Citer

Guillaume Lazzara, Thierry Géraud. Efficient Multiscale Sauvola's Binarization. International Journal on Document Analysis and Recognition, 2014, 17 (2), pp.105-123. ⟨10.1007/s10032-013-0209-0⟩. ⟨hal-02181880⟩
45 Consultations
948 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More