Annotating a large corpus with anaphoric links - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2000

Annotating a large corpus with anaphoric links

Agnès Tutin
Catherine Clouzot
  • Fonction : Auteur
Éric Gaussier
Stéphanie Rayot
  • Fonction : Auteur
Georges Antoniadis

Résumé

This paper presents a one million word French corpus annotated with anaphoric links. The anaphoric expressions selected are mainly grammatical discourse phenomena for which a reliable annotation could be provided. The annotation scheme, defined in XML, encodes the orientation of the anaphoric relation by using a specific element for relating the anaphoric expression to its antecedent(s). A set of five semantic relations is used to type the anaphoric relation. As a rule, linguistic expressions selected are phrases, but the annotation scheme uses specific elements to deal with descriptive anaphors which occur in nominal ellipses and demonstrative anaphors. Special cases such as multiple antecedents, discontinuous elements or ambiguity are discussed.
Fichier principal
Vignette du fichier
tutin_daarc2000.pdf (114.88 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00373327 , version 1 (03-04-2009)

Identifiants

  • HAL Id : hal-00373327 , version 1

Citer

Agnès Tutin, François Trouilleux, Catherine Clouzot, Éric Gaussier, Annie Zaenen, et al.. Annotating a large corpus with anaphoric links. Third International Conference on Discourse Anaphora and Anaphor Resolution (DAARC2000), 2000, United Kingdom. pp.2. ⟨hal-00373327⟩

Collections

PRES_CLERMONT
175 Consultations
379 Téléchargements

Partager

Gmail Facebook X LinkedIn More