ANNODIS : une approche outillée de l'annotation de structures discursives

Abstract : The ANNODIS project has two interconnected objectives: to produce a corpus of texts annotated at discourse-level, and to develop tools for corpus annotation and exploitation. Two sets of annotations are proposed, representing two complementary perspectives on discourse organisation: a bottom-up approach starting from minimal discourse units and building complex structures via a set of discourse relations; a top-down approach envisaging the text as a whole and using pre-identified cues to detect discourse macro-structures. The construction of the corpus goes hand in hand with the development of two interfaces: the first one supports manual annotation of discourse structures, and allows different views of the texts using NLP-based pre-processing; another interface will support the exploitation of the annotations. We present the discourse models and annotation protocols, and the interface which embodies them.
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00410590
Contributor : Marie-Paule Péry-Woodley <>
Submitted on : Friday, August 21, 2009 - 3:04:25 PM
Last modification on : Thursday, October 17, 2019 - 8:52:09 AM
Long-term archiving on : Tuesday, June 15, 2010 - 8:58:54 PM

File

TALN_52.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00410590, version 1

Citation

Marie-Paule Péry-Woodley, Nicholas Asher, Patrice Enjalbert, Farah Benamara, Myriam Bras, et al.. ANNODIS : une approche outillée de l'annotation de structures discursives. TALN 2009 (Conférence sur le Traitement Automatique des Langues Naturelles), Jun 2009, Senlis, France. paper_TALN_52. ⟨hal-00410590⟩

Share

Metrics

Record views

1003

Files downloads

452