Developing an annotator for Latin texts using Wikipedia

Abstract : This work investigates the feasibility of using Wikipedia as a resource for annotations of Latin texts. Although Wikipedia is an excellent resource from which to extract many kinds of information (morphological, syntactic and semantic) to be used in NLP tasks on modern languages, it was rarely applied to perform NLP tasks for the Latin language. The work presents the first steps of the development of a POS Tagger based on the Latin version of Wiktionary and a Wikipedia-based semantic annotator.
Type de document :
Article dans une revue
Journal of Data Mining and Digital Humanities, Episciences.org, In press
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01279853
Contributeur : Raffaele Guarasci <>
Soumis le : jeudi 30 novembre 2017 - 16:57:56
Dernière modification le : jeudi 7 décembre 2017 - 01:04:31
Document(s) archivé(s) le : jeudi 1 mars 2018 - 12:13:24

Fichier

Developing an annotator for La...
Fichiers éditeurs autorisés sur une archive ouverte

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

  • HAL Id : hal-01279853, version 2

Collections

Citation

Raffaele Guarasci. Developing an annotator for Latin texts using Wikipedia. Journal of Data Mining and Digital Humanities, Episciences.org, In press. 〈hal-01279853v2〉

Partager

Métriques

Consultations de la notice

52

Téléchargements de fichiers

51