Co-clustering Documents and Words by Minimizing the Normalized Cut Objective Function - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of Mathematical Modelling and Algorithms Année : 2010

Co-clustering Documents and Words by Minimizing the Normalized Cut Objective Function

Résumé

This paper follows a word-document co-clustering model independently introduced in 2001 by several authors such as I.S. Dhillon, H. Zha and C. Ding. This model consists in creating a bipartite graph based on word frequencies in documents, and whose vertices are both documents and words. The created bipartite graph is then partitioned in a way that minimizes the normalized cut objective function to produce the document clustering. The fusion-fission graph partitioning metaheuristic is applied on several document collections using this word-document co-clustering model. Results demonstrate a real problem in this model: partitions found almost always have a normalized cut value lowest than the original document collection clustering. Moreover, measures of the goodness of solutions seem to be relatively independent of the normalized cut values of partitions.
Fichier principal
Vignette du fichier
Liris-4669.pdf (222.74 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01381475 , version 1 (06-03-2017)

Identifiants

Citer

Charles-Edmond Bichot. Co-clustering Documents and Words by Minimizing the Normalized Cut Objective Function. Journal of Mathematical Modelling and Algorithms, 2010, 2, 9, pp.131-147. ⟨10.1007/s10852-010-9126-0⟩. ⟨hal-01381475⟩
96 Consultations
401 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More