Automatic Subject Indexing and Classification Using Text Recognition and Computer-Based Analysis of Tables of Contents - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Proceedings of the ElPub Conference Année : 2018

Automatic Subject Indexing and Classification Using Text Recognition and Computer-Based Analysis of Tables of Contents

Jan Pokorny
  • Fonction : Auteur

Résumé

This paper will describe a method for machine-based creation of high quality subject indexing and classification for both electronic and print documents using tables of contents (ToCs). The technology described here is primarily focused on electronic and print documents for which, because of technical or licensing reasons, it is not possible to index full text. However, the technology would also be useful for full text documents, because it could significantly enhance the accuracy and relevance of subject description by analyzing the structure of ToCs.
Fichier principal
Vignette du fichier
PokornyJan_ELPUB2018.pdf (113.86 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01816705 , version 1 (15-06-2018)

Licence

Paternité

Identifiants

Citer

Jan Pokorny. Automatic Subject Indexing and Classification Using Text Recognition and Computer-Based Analysis of Tables of Contents. ELPUB 2018, Jun 2018, Toronto, Canada. ⟨10.4000/proceedings.elpub.2018.19⟩. ⟨hal-01816705⟩
339 Consultations
1228 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More