Modeling the Complexity of Manual Annotation Tasks: a Grid of Analysis

Karën Fort 1, 2 Adeline Nazarenko 2 Sophie Rosset 3
1 SEMAGRAMME - Semantic Analysis of Natural Language
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
LIPN - Laboratoire d'Informatique de Paris-Nord
Abstract : Manual corpus annotation is getting widely used in Natural Language Processing (NLP). While being recognized as a difficult task, no in-depth analysis of its complexity has been performed yet. We provide in this article a grid of analysis of the different complexity dimensions of an annotation task, which helps estimating beforehand the difficulties and cost of annotation campaigns. We observe the applicability of this grid on existing annotation campaigns and detail its application on a real-world example.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [24 references]  Display  Hide  Download
Contributor : Karën Fort <>
Submitted on : Wednesday, January 2, 2013 - 4:37:59 PM
Last modification on : Wednesday, April 3, 2019 - 1:23:00 AM
Document(s) archivé(s) le : Friday, March 31, 2017 - 10:19:36 PM


Files produced by the author(s)


  • HAL Id : hal-00769631, version 1


Karën Fort, Adeline Nazarenko, Sophie Rosset. Modeling the Complexity of Manual Annotation Tasks: a Grid of Analysis. International Conference on Computational Linguistics, Dec 2012, Mumbaï, India. pp.895--910. ⟨hal-00769631⟩



Record views


Files downloads