Annotating a large corpus with anaphoric links

Abstract : This paper presents a one million word French corpus annotated with anaphoric links. The anaphoric expressions selected are mainly grammatical discourse phenomena for which a reliable annotation could be provided. The annotation scheme, defined in XML, encodes the orientation of the anaphoric relation by using a specific element for relating the anaphoric expression to its antecedent(s). A set of five semantic relations is used to type the anaphoric relation. As a rule, linguistic expressions selected are phrases, but the annotation scheme uses specific elements to deal with descriptive anaphors which occur in nominal ellipses and demonstrative anaphors. Special cases such as multiple antecedents, discontinuous elements or ambiguity are discussed.
Type de document :
Communication dans un congrès
Third International Conference on Discourse Anaphora and Anaphor Resolution (DAARC2000), 2000, United Kingdom. pp.2, 2000
Liste complète des métadonnées

Littérature citée [6 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00373327
Contributeur : François Trouilleux <>
Soumis le : vendredi 3 avril 2009 - 21:00:38
Dernière modification le : lundi 15 janvier 2018 - 21:46:02
Document(s) archivé(s) le : jeudi 10 juin 2010 - 18:09:23

Fichier

tutin_daarc2000.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00373327, version 1

Collections

Citation

Agnès Tutin, François Trouilleux, Catherine Clouzot, Éric Gaussier, Annie Zaenen, et al.. Annotating a large corpus with anaphoric links. Third International Conference on Discourse Anaphora and Anaphor Resolution (DAARC2000), 2000, United Kingdom. pp.2, 2000. 〈hal-00373327〉

Partager

Métriques

Consultations de la notice

244

Téléchargements de fichiers

319