Post-Retrieval Clustering Using Third-Order Similarity Measures - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Post-Retrieval Clustering Using Third-Order Similarity Measures

Résumé

Post-retrieval clustering is the task of clustering Web search results. Within this context, we propose a new methodology that adapts the classical K-means algorithm to a third-order similarity measure initially developed for NLP tasks. Results obtained with the definition of a new stopping criterion over the ODP-239 and the MORESQUE golden standard datasets evidence that our proposal outperforms all reported text-based approaches.
Fichier principal
Vignette du fichier
ACTI-MORENO-2013-4.pdf (4.68 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00931263 , version 1 (19-09-2014)

Identifiants

  • HAL Id : hal-00931263 , version 1

Citer

José G. Moreno, Gaël Dias, Guillaume Cleuziou. Post-Retrieval Clustering Using Third-Order Similarity Measures. Annual Meeting of the Association for Computational Linguistics (ACL 2013), Aug 2013, Sofia, Bulgaria. pp.153-158. ⟨hal-00931263⟩
190 Consultations
52 Téléchargements

Partager

Gmail Facebook X LinkedIn More