Modeling the Complexity of Manual Annotation Tasks: a Grid of Analysis

Karën Fort 1, 2 Adeline Nazarenko 2 Sophie Rosset 3
1 SEMAGRAMME - Semantic Analysis of Natural Language
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
2 RCLN
LIPN - Laboratoire d'Informatique de Paris-Nord
Abstract : Manual corpus annotation is getting widely used in Natural Language Processing (NLP). While being recognized as a difficult task, no in-depth analysis of its complexity has been performed yet. We provide in this article a grid of analysis of the different complexity dimensions of an annotation task, which helps estimating beforehand the difficulties and cost of annotation campaigns. We observe the applicability of this grid on existing annotation campaigns and detail its application on a real-world example.
Type de document :
Communication dans un congrès
International Conference on Computational Linguistics, Dec 2012, Mumbaï, India. pp.895--910, 2012
Liste complète des métadonnées

Littérature citée [24 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00769631
Contributeur : Karën Fort <>
Soumis le : mercredi 2 janvier 2013 - 16:37:59
Dernière modification le : mardi 18 décembre 2018 - 16:38:01
Document(s) archivé(s) le : vendredi 31 mars 2017 - 22:19:36

Fichier

coling2012_Complexity_KF_30102...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00769631, version 1

Citation

Karën Fort, Adeline Nazarenko, Sophie Rosset. Modeling the Complexity of Manual Annotation Tasks: a Grid of Analysis. International Conference on Computational Linguistics, Dec 2012, Mumbaï, India. pp.895--910, 2012. 〈hal-00769631〉

Partager

Métriques

Consultations de la notice

665

Téléchargements de fichiers

311