ANCOR_Centre, a Large Free Spoken French Coreference Corpus: description of the Resource and Reliability Measures

Judith Muzerelle 1 Anaïs Lefeuvre 2 Emmanuel Schang 1 Jean-Yves Antoine 2 Aurore Pelletier 1 Denis Maurel 2 Iris Eshkol 1 Jeanne Villaneau 3
2 BDTLN - Bases de données et traitement des langues naturelles
LI - Laboratoire d'Informatique de l'Université de Tours
3 SEASIDE - SEarch, Analyze, Synthesize and Interact with Data Ecosystems
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, UBS - Université de Bretagne Sud
Abstract : This article presents ANCOR_Centre, a French coreference corpus, available under the Creative Commons Licence. With a size of around 500,000 words, the corpus is large enough to serve the needs of data-driven approaches in NLP and represents one of the largest coreference resources currently available. The corpus focuses exclusively on spoken language, it aims at representing a certain variety of spoken genders. ANCOR_Centre includes anaphora as well as coreference relations which involve nominal and pronominal mentions. The paper describes into details the annotation scheme and the reliability measures computed on the resource.
Type de document :
Communication dans un congrès
ELRA. LREC'2014, 9th Language Resources and Evaluation Conference., May 2014, Reyjavik, Iceland. pp.MUZERELLE14.150, 2014, <http://www.lrec-conf.org/proceedings/lrec2014/index.html>


https://hal.archives-ouvertes.fr/hal-01075679
Contributeur : Jean-Yves Antoine <>
Soumis le : dimanche 19 octobre 2014 - 15:57:57
Dernière modification le : jeudi 20 octobre 2016 - 11:58:46
Document(s) archivé(s) le : mardi 20 janvier 2015 - 10:44:20

Fichier

2014_LREC_ANCOR.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01075679, version 1

Citation

Judith Muzerelle, Anaïs Lefeuvre, Emmanuel Schang, Jean-Yves Antoine, Aurore Pelletier, et al.. ANCOR_Centre, a Large Free Spoken French Coreference Corpus: description of the Resource and Reliability Measures. ELRA. LREC'2014, 9th Language Resources and Evaluation Conference., May 2014, Reyjavik, Iceland. pp.MUZERELLE14.150, 2014, <http://www.lrec-conf.org/proceedings/lrec2014/index.html>. <hal-01075679>

Exporter

Partager

Métriques

Consultations de
la notice

227

Téléchargements du document

124