Krik: First Steps into Crowdsourcing POS tags for Kréyòl Gwadloupéyen

Abstract : This article presents the adaptation to Guadeloupean Creole of a project of crowdsourcing part-of-speech (POS) tags initially designed for a French regional language, Alsatian. We do not detail here the specifically developed crowdsourcing platform and methodology, but rather focus on the construction of the required elements for a language to be a candidate for this task: i) an open-source raw corpus, ii) a tokenizer, iii) adapted annotation guidelines, iv) a minimal reference, and, preferentially, v) one or two baseline tagger(s). After describing the preliminary work we have carried out for Guadeloupean Creole to comply with these prerequisites, we present the first results on crowdsourcing POS tags through the platform specifically developed for this task: Krik.
Type de document :
Communication dans un congrès
CCURL 2018 , May 2018, Miyazaki, Japan
Liste complète des métadonnées

Littérature citée [15 références]  Voir  Masquer  Télécharger
Contributeur : Karën Fort <>
Soumis le : dimanche 13 mai 2018 - 14:11:57
Dernière modification le : jeudi 21 mars 2019 - 12:01:18
Document(s) archivé(s) le : mardi 25 septembre 2018 - 07:01:15


Fichiers produits par l'(les) auteur(s)


  • HAL Id : hal-01790617, version 1


Alice Millour, Karën Fort. Krik: First Steps into Crowdsourcing POS tags for Kréyòl Gwadloupéyen. CCURL 2018 , May 2018, Miyazaki, Japan. 〈hal-01790617〉



Consultations de la notice


Téléchargements de fichiers