Distributional Thesauri for Information Retrieval and vice versa

Vincent Claveau 1 Ewa Kijak 1
1 LinkMedia - Creating and exploiting explicit links between multimedia fragments
Inria Rennes – Bretagne Atlantique , IRISA-D6 - MEDIA ET INTERACTIONS
Abstract : Distributional thesauri are useful in many tasks of Natural Language Processing. In this paper, we address the problem of building and evaluating such thesauri with the help of Information Retrieval (IR) concepts. Two main contributions are proposed. First, following the work of (Claveau et al., 2014), we show how IR tools and concepts can be used with success to build a thesaurus. Through several experiments and by evaluating directly the results with reference lexicons, we show that some IR models outperform state-of-the-art systems. Secondly, we use IR as an applicative framework to indirectly evaluate the generated thesaurus. Here again, this task-based evaluation validates the IR approach used to build the thesaurus. Moreover, it allows us to compare these results with those from the direct evaluation framework used in the literature. The observed differences bring these evaluation habits into question.
Type de document :
Communication dans un congrès
Language and Resource Conference, LREC, May 2016, Portoroz, Slovenia. Proceedings of Language and Resource Conference, LREC
Liste complète des métadonnées


https://hal.archives-ouvertes.fr/hal-01394770
Contributeur : Vincent Claveau <>
Soumis le : mercredi 9 novembre 2016 - 17:12:15
Dernière modification le : vendredi 17 février 2017 - 16:11:02
Document(s) archivé(s) le : mercredi 15 mars 2017 - 04:04:24

Fichier

Claveau_LREC16.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01394770, version 1

Citation

Vincent Claveau, Ewa Kijak. Distributional Thesauri for Information Retrieval and vice versa. Language and Resource Conference, LREC, May 2016, Portoroz, Slovenia. Proceedings of Language and Resource Conference, LREC. <hal-01394770>

Partager

Métriques

Consultations de
la notice

207

Téléchargements du document

123