Post-Retrieval Clustering Using Third-Order Similarity Measures

José G. Moreno; Gaël Dias; Guillaume Cleuziou

Communication Dans Un Congrès Année : 2013

Post-Retrieval Clustering Using Third-Order Similarity Measures

(1) , (1) , (2)

1
2

José G. Moreno

Fonction : Auteur
PersonId : 743396
IdHAL : jose-g-moreno
ORCID : 0000-0002-8852-5797
IdRef : 190544007

Equipe Hultech - Laboratoire GREYC - UMR6072

Gaël Dias

Fonction : Auteur
PersonId : 3735
IdHAL : gael-dias
ORCID : 0000-0002-5840-1603
IdRef : 113779747

Equipe Hultech - Laboratoire GREYC - UMR6072

Guillaume Cleuziou

Fonction : Auteur
PersonId : 834265

Laboratoire d'Informatique Fondamentale d'Orléans

Résumé

Post-retrieval clustering is the task of clustering Web search results. Within this context, we propose a new methodology that adapts the classical K-means algorithm to a third-order similarity measure initially developed for NLP tasks. Results obtained with the definition of a new stopping criterion over the ODP-239 and the MORESQUE golden standard datasets evidence that our proposal outperforms all reported text-based approaches.

Domaines

Recherche d'information [cs.IR] Apprentissage [cs.LG]

Fichier principal

ACTI-MORENO-2013-4.pdf (4.68 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Guillaume Cleuziou : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00931263

Soumis le : vendredi 19 septembre 2014-14:40:41

Dernière modification le : mercredi 20 mars 2024-16:20:04

Archivage à long terme le : samedi 20 décembre 2014-10:31:36

Dates et versions

hal-00931263 , version 1 (19-09-2014)

Identifiants

HAL Id : hal-00931263 , version 1

Citer

José G. Moreno, Gaël Dias, Guillaume Cleuziou. Post-Retrieval Clustering Using Third-Order Similarity Measures. Annual Meeting of the Association for Computational Linguistics (ACL 2013), Aug 2013, Sofia, Bulgaria. pp.153-158. ⟨hal-00931263⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-ORLEANS MSL MSL-THESE GREYC GREYC-HULTECH COMUE-NORMANDIE ENSICAEN UNICAEN

190 Consultations

52 Téléchargements

Post-Retrieval Clustering Using Third-Order Similarity Measures

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager