Alpha-numerical sequences extraction in handwritten documents - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Alpha-numerical sequences extraction in handwritten documents

Résumé

In this paper, we introduce an alpha-numerical sequences extraction system (keywords, numerical fields or alpha-numerical sequences) in unconstrained handwritten documents. Contrary to most of the approaches presented in the literature, our system relies on a global handwriting line model describing two kinds of information : i) the relevant information and ii) the irrelevant information represented by a shallow parsing model. The shallow parsing of isolated text lines allows quick information extraction in any document while rejecting at the same time irrelevant information. Results on a public french incoming mails database show the efficiency of the approach.
Fichier principal
Vignette du fichier
ICFHR2010Thomasim.pdf (345.84 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00488290 , version 1 (01-06-2010)

Identifiants

  • HAL Id : hal-00488290 , version 1

Citer

Simon Thomas, Clement Chatelain, Laurent Heutte, Thierry Paquet. Alpha-numerical sequences extraction in handwritten documents. ICFHR, Nov 2010, India. pp.6. ⟨hal-00488290⟩
34 Consultations
95 Téléchargements

Partager

Gmail Facebook X LinkedIn More