DiSeg 1.0: The first system for Spanish discourse segmentation - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Expert Systems with Applications Année : 2012

DiSeg 1.0: The first system for Spanish discourse segmentation

Résumé

Nowadays discourse parsing is a very prominent research topic. However, there is not a discourse parser for Spanish texts. The first stage in order to develop this tool is discourse segmentation. In this work, we present DiSeg, the first discourse segmenter for Spanish, which uses the framework of Rhetorical Structure Theory and is based on lexical and syntactic rules. We describe the system and we evaluate its performance against a gold standard corpus, divided in a medical and a terminological subcorpus. We obtain promising results, which means that discourse segmentation is possible using shallow parsing.
Fichier non déposé

Dates et versions

hal-01314824 , version 1 (12-05-2016)

Identifiants

Citer

Iria da Cunha, Eric San, Juan-Manuel Torres-Moreno, Marina Lloberese, Irene Castellóne. DiSeg 1.0: The first system for Spanish discourse segmentation. Expert Systems with Applications, 2012, ⟨10.1016/j.eswa.2011.06.058⟩. ⟨hal-01314824⟩

Collections

UNIV-AVIGNON LIA
68 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More