Skip to Main content Skip to Navigation
Conference papers

A bi-clustering framework for categorical data

Céline Robardet 1 Ruggero G Pensa 1 Jean-François Boulicaut 1
1 DM2L - Data Mining and Machine Learning
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : Bi-clustering is a promising conceptual clustering approach. Within categorical data, it provides a collection of (possibly overlapping) bi-clusters, i.e., linked clusters for both objects and attribute-value pairs. We propose a generic framework for bi-clustering which enables to compute a bi-partition from collections of local patterns which capture locally strong associations between objects and properties. To validate this framework, we have studied in details the instance CDK-Means. It is a K-Means-like clustering on collections of formal concepts, i.e., connected closed sets on both dimensions. It enables to build bi-partitions with a user control on overlapping between bi-clusters. We provide an experimental validation on many benchmark datasets and discuss the interestingness of the computed bi-partitions.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01535565
Contributor : Céline Robardet <>
Submitted on : Friday, June 9, 2017 - 10:29:21 AM
Last modification on : Thursday, November 21, 2019 - 1:44:06 AM

Links full text

Identifiers

Citation

Céline Robardet, Ruggero G Pensa, Jean-François Boulicaut. A bi-clustering framework for categorical data. 9th European Conf. on Principles and Practice of Knowledge Discovery in Databases, PKDD'05, Sep 2005, Porto, Portugal. pp.643-650, ⟨10.1007/11564126_68⟩. ⟨hal-01535565⟩

Share

Metrics

Record views

132