A bi-clustering framework for categorical data

Céline Robardet; Ruggero G Pensa; Jean-François Boulicaut

doi:10.1007/11564126_68

Communication Dans Un Congrès Année : 2005

A bi-clustering framework for categorical data

(1) , (1) , (1)

Céline Robardet

Fonction : Auteur
PersonId : 3355
IdHAL : celine-robardet
ORCID : 0000-0002-8583-9408
IdRef : 070207054

Data Mining and Machine Learning

Ruggero G Pensa

Fonction : Auteur

Data Mining and Machine Learning

Jean-François Boulicaut

Fonction : Auteur
PersonId : 7269
IdHAL : jfboulicaut
IdRef : 072329130

Data Mining and Machine Learning

Résumé

Bi-clustering is a promising conceptual clustering approach. Within categorical data, it provides a collection of (possibly overlapping) bi-clusters, i.e., linked clusters for both objects and attribute-value pairs. We propose a generic framework for bi-clustering which enables to compute a bi-partition from collections of local patterns which capture locally strong associations between objects and properties. To validate this framework, we have studied in details the instance CDK-Means. It is a K-Means-like clustering on collections of formal concepts, i.e., connected closed sets on both dimensions. It enables to build bi-partitions with a user control on overlapping between bi-clusters. We provide an experimental validation on many benchmark datasets and discuss the interestingness of the computed bi-partitions.

Domaines

Intelligence artificielle [cs.AI] Base de données [cs.DB] Algorithme et structure de données [cs.DS] Apprentissage [cs.LG]

Céline Robardet : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01535565

Soumis le : vendredi 9 juin 2017-10:29:21

Dernière modification le : mercredi 5 juillet 2023-15:28:04

Dates et versions

hal-01535565 , version 1 (09-06-2017)

Identifiants

HAL Id : hal-01535565 , version 1
DOI : 10.1007/11564126_68

Citer

Céline Robardet, Ruggero G Pensa, Jean-François Boulicaut. A bi-clustering framework for categorical data. 9th European Conf. on Principles and Practice of Knowledge Discovery in Databases, PKDD'05, Sep 2005, Porto, Portugal. pp.643-650, ⟨10.1007/11564126_68⟩. ⟨hal-01535565⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS LABEXIMU INSA-GROUPE UDL

93 Consultations

0 Téléchargements

A bi-clustering framework for categorical data

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager