Annotating Football Matches: Influence of the Source Medium on Manual Annotation

Karen Fort 1 Vincent Claveau 2
LIPN - Laboratoire d'Informatique de Paris-Nord
2 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : In this paper, we present an annotation campaign of football (soccer) matches, from a heterogeneous text corpus of both match minutes and video commentary transcripts, in French. The data, annotations and evaluation process are detailed, and the quality of the annotated corpus is discussed. In particular, we propose a new technique to better estimate the annotator agreement when few elements of a text are to be annotated. Based on that, we show how the source medium influenced the process and the quality.
Document type :
Conference papers
Complete list of metadatas

Cited literature [16 references]  Display  Hide  Download
Contributor : Karën Fort <>
Submitted on : Monday, June 18, 2012 - 9:55:12 AM
Last modification on : Friday, September 6, 2019 - 11:48:09 AM
Long-term archiving on : Wednesday, September 19, 2012 - 2:31:00 AM


Files produced by the author(s)


  • HAL Id : hal-00709170, version 1


Karen Fort, Vincent Claveau. Annotating Football Matches: Influence of the Source Medium on Manual Annotation. LREC - Eight International Conference on Language Resources and Evaluation, May 2012, Istanbul, Turkey. ⟨hal-00709170⟩



Record views


Files downloads