Constrained co-clustering of gene expression data - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Constrained co-clustering of gene expression data

Résumé

In many applications, the expert interpretation of co-clustering is easier than for mono-dimensional clustering. Co-clustering aims at computing a bi-partition that is a collection of co-clusters: each co-cluster is a group of objects associated to a group of attributes and these associations can support interpretations. Many constrained clustering algorithms have been proposed to exploit the domain knowledge and to improve partition relevancy in the mono-dimensional case (e.g., using the so-called must-link and cannot-link constraints). Here, we consider constrained co-clustering not only for extended must-link and cannot-link constraints (i.e., both objects and attributes can be involved), but also for interval constraints that enforce properties of co-clusters when considering ordered domains. We propose an iterative co-clustering algorithm which exploits user-defined constraints while minimizing the sum-squared residues, i.e., an objective function introduced for gene expression data clustering by Cho et al (2004). We illustrate the added value of our approach in two applications that concern gene expression data analysis.
Fichier principal
Vignette du fichier
Liris-3330.pdf (373.02 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01500611 , version 1 (19-11-2022)

Licence

Paternité

Identifiants

Citer

Ruggero Pensa, Jean-François Boulicaut. Constrained co-clustering of gene expression data. SIAM International Conference on Data Mining SDM'08, Apr 2008, Atlanta, United States. pp.25-36, ⟨10.1137/1.9781611972788.3⟩. ⟨hal-01500611⟩
1066 Consultations
19 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More