Post-Retrieval Clustering Using Third-Order Similarity Measures

José G. Moreno; Gaël Dias; Guillaume Cleuziou

Conference Papers Year : 2013

Post-Retrieval Clustering Using Third-Order Similarity Measures

(1) , (1) , (2)

1
2

José G. Moreno

Function : Author
PersonId : 743396
IdHAL : jose-g-moreno
ORCID : 0000-0002-8852-5797
IdRef : 190544007

Equipe Hultech - Laboratoire GREYC - UMR6072

Gaël Dias

Function : Author
PersonId : 3735
IdHAL : gael-dias
ORCID : 0000-0002-5840-1603
IdRef : 113779747

Equipe Hultech - Laboratoire GREYC - UMR6072

Guillaume Cleuziou

Function : Author
PersonId : 834265

Laboratoire d'Informatique Fondamentale d'Orléans

Abstract

Post-retrieval clustering is the task of clustering Web search results. Within this context, we propose a new methodology that adapts the classical K-means algorithm to a third-order similarity measure initially developed for NLP tasks. Results obtained with the definition of a new stopping criterion over the ODP-239 and the MORESQUE golden standard datasets evidence that our proposal outperforms all reported text-based approaches.

Domains

Information Retrieval [cs.IR] Machine Learning [cs.LG]

Fichier principal

ACTI-MORENO-2013-4.pdf (4.68 Mo)

Origin : Files produced by the author(s)

Guillaume Cleuziou : Connect in order to contact the contributor

https://hal.science/hal-00931263

Submitted on : Friday, September 19, 2014-2:40:41 PM

Last modification on : Wednesday, March 20, 2024-4:20:04 PM

Long-term archiving on: Saturday, December 20, 2014-10:31:36 AM

Dates and versions

hal-00931263 , version 1 (19-09-2014)

Identifiers

HAL Id : hal-00931263 , version 1

Cite

José G. Moreno, Gaël Dias, Guillaume Cleuziou. Post-Retrieval Clustering Using Third-Order Similarity Measures. Annual Meeting of the Association for Computational Linguistics (ACL 2013), Aug 2013, Sofia, Bulgaria. pp.153-158. ⟨hal-00931263⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-ORLEANS MSL MSL-THESE GREYC GREYC-HULTECH COMUE-NORMANDIE ENSICAEN UNICAEN

190 View

52 Download

Post-Retrieval Clustering Using Third-Order Similarity Measures

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share