An Efficient Parametrization of Character Degradation Model for Semi-synthetic Image Generation

Abstract : This paper presents an efficient parametrization method for generating synthetic noise on document images. By specifying the desired categories and amount of noise, the method is able to generate synthetic document images with most of degradations observed in real document images (ink splotches, white specks or streaks). Thanks to the ability of simulating different amount and kind of noise, it is possible to evaluate the robustness of many document image analysis methods. It also permits to generate data for algorithms that employ a learning process. The degradation model presented in [7] needs eight parameters for generating randomly noise regions. We propose here an extension of this model which aims to set automatically the eight parameters to generate precisely what a user wants (amount and category of noise). Our proposition consists of three steps. First, Nsp seed-points (i.e. centres of noise regions) are selected by an adaptive procedure. Then, these seed-points are classified into three categories of noise by using a heuristic rule. Finally, each size of noise region is set using a random process in order to generate degradations as realistic as possible.
Type de document :
Communication dans un congrès
2nd International Workshop on Historical Document Imaging and Processing, Aug 2013, Washington, DC, USA, United States
Liste complète des métadonnées


https://hal.archives-ouvertes.fr/hal-01006078
Contributeur : Van Cuong Kieu <>
Soumis le : vendredi 13 juin 2014 - 16:27:40
Dernière modification le : jeudi 9 février 2017 - 16:58:52
Document(s) archivé(s) le : samedi 13 septembre 2014 - 11:30:21

Fichier

Paper_15_crvHIP2013.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01006078, version 1

Collections

Citation

Van Cuong Kieu, Muriel Visani, Nicholas Journet, Rémy Mullot, Jean-Philippe Domenger. An Efficient Parametrization of Character Degradation Model for Semi-synthetic Image Generation. 2nd International Workshop on Historical Document Imaging and Processing, Aug 2013, Washington, DC, USA, United States. <hal-01006078>

Partager

Métriques

Consultations de
la notice

332

Téléchargements du document

94