Skip to Main content Skip to Navigation
Journal articles

SIMTEX: An Approach for Detecting and Measuring Textual Similarity based on Discourse and Semantics

Abstract : Nowadays automatic systems for detecting and measuring textual similarity are being developed, in order to apply them to different tasks in the field of Natural Language Processing (NLP). Currently, these systems use surface linguistic features or statistical information. Nowadays, few researchers use deep linguistic information. In this work, we present an algorithm for detecting and measuring textual similarity that takes into account information offered by discourse relations of Rhetorical Structure Theory (RST), and lexical-semantic relations included in EuroWordNet. We apply the algorithm , called SIMTEX, to texts written in Spanish, but the methodology is potentially language-independent.
Complete list of metadatas

Cited literature [22 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02550811
Contributor : Juan-Manuel Torres-Moreno <>
Submitted on : Monday, May 4, 2020 - 5:32:31 PM
Last modification on : Thursday, May 7, 2020 - 2:21:47 PM

File

SIMTEX_An_Approach_for_Detecti...
Publisher files allowed on an open archive

Identifiers

Collections

Citation

Iria da Cunha, Jorge Vivaldi, Juan-Manuel Torres-Moreno, Gerardo Eugenio Sierra-Martinez. SIMTEX: An Approach for Detecting and Measuring Textual Similarity based on Discourse and Semantics. Computación y sistemas, Instituto Politécnico Nacional IPN Centro de Investigación en Computación, 2014, 18 (3), ⟨10.13053/CyS-18-3-2033⟩. ⟨hal-02550811⟩

Share

Metrics

Record views

11

Files downloads

11