Developing an annotator for Latin texts using Wikipedia - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of Data Mining and Digital Humanities Année : 2017

Developing an annotator for Latin texts using Wikipedia

Résumé

This work investigates the feasibility of using Wikipedia as a resource for annotations of Latin texts. Although Wikipedia is an excellent resource from which to extract many kinds of information (morphological, syntactic and semantic) to be used in NLP tasks on modern languages, it was rarely applied to perform NLP tasks for the Latin language. The work presents the first steps of the development of a POS Tagger based on the Latin version of Wiktionary and a Wikipedia-based semantic annotator.
Fichier principal
Vignette du fichier
Developing an annotator for Latin texts using Wikipedia.pdf (311.22 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-01279853 , version 1 (03-03-2016)
hal-01279853 , version 2 (30-11-2017)

Licence

Paternité

Identifiants

  • HAL Id : hal-01279853 , version 2

Citer

Raffaele Guarasci. Developing an annotator for Latin texts using Wikipedia. Journal of Data Mining and Digital Humanities, In press. ⟨hal-01279853v2⟩
269 Consultations
248 Téléchargements

Partager

Gmail Facebook X LinkedIn More