Enrichment of Renaissance texts with proper names - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue INFOtheca : Journal of Information and Library Science Année : 2014

Enrichment of Renaissance texts with proper names

Résumé

The Renom project proposes to enrich Renaissance texts by proper names. These texts present two new challenges: great diversity due to various spellings of words; numerous XML-TEI tags to save the exact format of original edition. The task consisted to add Named Entity tags to this format tagging with generally the left context and sometimes the right context of a name. To do that, we improved the free and open source program CasSys to parse texts with Unitex graph cascades and we built dictionaries and specific cascades. The slot error rate was 6.1%. Proper Names and maps. were to allow navigating into. So, this paper deals with Named Entity Recognition in Renaissance texts.
Fichier principal
Vignette du fichier
Infoteka 2014.pdf (929.18 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01174733 , version 1 (17-07-2015)

Identifiants

  • HAL Id : hal-01174733 , version 1

Citer

Denis Maurel, Nathalie Friburger, Iris Eshkol-Taravella. Enrichment of Renaissance texts with proper names. INFOtheca : Journal of Information and Library Science, 2014, 15 (1), pp.15-27. ⟨hal-01174733⟩
470 Consultations
438 Téléchargements

Partager

Gmail Facebook X LinkedIn More