TEI-friendly annotation scheme for medieval named entities: a case on a Spanish medieval corpus - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Language Resources and Evaluation Année : 2021

TEI-friendly annotation scheme for medieval named entities: a case on a Spanish medieval corpus

Résumé

Medieval documents are a rich source of historical data. Performing named-entity recognition (NER) on this genre of texts can provide us with valuable historical evidence. However, traditional NER categories and schemes are usually designed with modern documents in mind (i.e. journalistic text) and the general-domain NER annotation schemes fail to capture the nature of medieval entities. In this paper we explore the challenges of performing named-entity annotation on a corpus of Spanish medieval documents: we discuss the mismatches that arise when applying traditional NER categories to a corpus of Spanish medieval documents and we propose a novel humanist-friendly TEI-compliant annotation scheme and guidelines intended to capture the particular nature of medieval entities.

Dates et versions

hal-03226581 , version 1 (14-05-2021)

Identifiants

Citer

Elena Álvarez Mellado, María Luisa Díez Platas, Pablo Ruiz Fabo, Helena Bermúdez Sabel, Salvador Ros Muñoz, et al.. TEI-friendly annotation scheme for medieval named entities: a case on a Spanish medieval corpus. Language Resources and Evaluation, 2021, ⟨10.1007/s10579-020-09516-2⟩. ⟨hal-03226581⟩

Collections

SITE-ALSACE
50 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More