Thésaurus distributionnels pour la recherche d'information et vice-versa

Vincent Claveau 1 Ewa Kijak 1
1 LinkMedia - Creating and exploiting explicit links between multimedia fragments
Inria Rennes – Bretagne Atlantique , IRISA-D6 - MEDIA ET INTERACTIONS
Abstract : Distributional thesauri are useful in many tasks of Natural Language Processing. In this paper, we address the problem of building and evaluating such thesauri with the help of Information Retrieval concepts. Two main contributions are proposed. First, in the continuation of the work of Claveau et al., 2014, we show how IR tools and concepts can be used with success to build thesaurus. Through several experiments and by evaluating directly the results with reference lexicons, we show that some IR models outperform state-of-the-art systems. Secondly, we use IR as an application framework to indirectly evaluate the generated thesaurus. Here again, this task-based evaluation validate the IR approach used to build the thesaurus. Moreover, it allows us to compare these results with those from the direct evaluation framework used in the literature. The observed differences question these evaluation habits.
Liste complète des métadonnées

Cited literature [47 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01226551
Contributor : Vincent Claveau <>
Submitted on : Friday, November 27, 2015 - 1:41:15 PM
Last modification on : Thursday, February 7, 2019 - 5:17:35 PM
Document(s) archivé(s) le : Friday, April 28, 2017 - 5:29:47 AM

File

Claveau_Kijak_DN2015.pdf
Files produced by the author(s)

Identifiers

Citation

Vincent Claveau, Ewa Kijak. Thésaurus distributionnels pour la recherche d'information et vice-versa. Revue des Sciences et Technologies de l'Information - Série Document Numérique, Lavoisier, 2015, 18 (2-3), ⟨10.3166/DN.18.2-3.101-121⟩. ⟨hal-01226551⟩

Share

Metrics

Record views

293

Files downloads

268