Handwritten word preprocessing for database adaptation

Cristina Oprean; Laurence Likforman-Sulem; Chafic Mokbel

Communication Dans Un Congrès Proceedings of SPIE, the International Society for Optical Engineering Année : 2013

Handwritten word preprocessing for database adaptation

(1) , (1) , (2)

1
2

Cristina Oprean

Fonction : Auteur
PersonId : 952962

Laboratoire Traitement et Communication de l'Information

Laurence Likforman-Sulem

Fonction : Auteur
PersonId : 180333
IdHAL : laurence-likforman-sulem

Laboratoire Traitement et Communication de l'Information

Chafic Mokbel

Fonction : Auteur

University of Balamand [Liban]

Résumé

Handwriting recognition systems are typically trained using publicly available databases, where data have been collected in controlled conditions (image resolution, paper background, noise level, etc.). Since this is not often the case in real-world scenarios, classification performance can be affected when novel data is presented to the word recognition system. To overcome this problem, we present in this paper a new approach called database adaptation. It consists of processing one set (training or test) in order to adapt it to the other set (test or training, respectively). Specifically, two kinds of preprocessing, namely stroke thickness normalization and pixel intensity normalization are considered. The advantage of such approach is that we can re-use the existing recognition system trained on controlled data. We conduct several experiments with the Rimes 2011 word database and with a real-world database. We adapt either the test set or the training set. Results show that training set adaptation achieves better results than test set adaptation, at the cost of a second training stage on the adapted data. Accuracy of data set adaptation is increased by 2% to 3% in absolute value over no adaptation.

Mots clés

Handwritten word recognition database adaptation word preprocessing

Domaines

Intelligence artificielle [cs.AI] Multimédia [cs.MM] Recherche d'information [cs.IR] Traitement des images [eess.IV] Traitement du texte et du document Web

Fichier principal

article_final.pdf (280.64 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Cristina Oprean : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00948976

Soumis le : mardi 18 février 2014-18:39:21

Dernière modification le : lundi 9 octobre 2023-12:49:40

Archivage à long terme le : dimanche 18 mai 2014-12:16:33

Dates et versions

hal-00948976 , version 1 (18-02-2014)

Identifiants

HAL Id : hal-00948976 , version 1

Citer

Cristina Oprean, Laurence Likforman-Sulem, Chafic Mokbel. Handwritten word preprocessing for database adaptation. Document Recognition and Retrieval XX, Feb 2013, San Francisco, United States. pp.865808-865808-9. ⟨hal-00948976⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS PARISTECH LTCI IDS S2A

133 Consultations

242 Téléchargements

Handwritten word preprocessing for database adaptation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager