Ground-Truth Production and Benchmarking Scenarios Creation with DocMining - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2003

Ground-Truth Production and Benchmarking Scenarios Creation with DocMining

Résumé

In this paper we present the DocMining platform and its application to ground-truth datasets production and page segmentation evaluation. DocMining is a highly modular framework dedicated to document interpretation where document processing tasks are modelized with scenarios. We present here two scenarios which use PDF documents, found on the web or produced from XML files, as basis of the ground-truth dataset.
Fichier principal
Vignette du fichier
10.1.1.59.1985.pdf (761.22 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00637065 , version 1 (29-10-2011)

Identifiants

  • HAL Id : hal-00637065 , version 1

Citer

Eric Clavier, Pierre Héroux, Joël Gardes, Eric Trupin. Ground-Truth Production and Benchmarking Scenarios Creation with DocMining. International Workshop on Document Layout and Image Analysis, 2003, Edimburgh, United Kingdom. pp.31--35. ⟨hal-00637065⟩
106 Consultations
175 Téléchargements

Partager

Gmail Facebook X LinkedIn More