A supervised methodology to measure the variables contribution to a clustering

Oumaima Alaoui Ismaili; Vincent Lemaire; Antoine Cornuéjols

Communication Dans Un Congrès Année : 2014

A supervised methodology to measure the variables contribution to a clustering

(1, 2) , (2) , (1)

1
2

Oumaima Alaoui Ismaili

Fonction : Auteur

Mathématiques et Informatique Appliquées

Orange Labs

Vincent Lemaire

Fonction : Auteur

Orange Labs

Antoine Cornuéjols

Fonction : Auteur
PersonId : 182386
IdHAL : antoine-cornuejols
ORCID : 0000-0002-2979-3521
IdRef : 067132669

Mathématiques et Informatique Appliquées

Résumé

This article proposes a supervised approach to evaluate the contribution of explanatory variables to a clustering. The main idea is to learn to predict the instance membership to the clusters using each individual variable. All variables are then sorted with respect to their predictive power, which is measured using two evaluation criteria, i.e. accuracy (ACC) or Adjusted Rand Index (ARI). Once the relevant variables which contribute to the clustering discrimination have been determined, we filter out the redundant ones thanks to a supervised method. The aim of this work is to help end-users to easily understand a clustering of high-dimensional data. Experimental results show that our proposed method is competitive with existing methods from the literature.

Mots clés

supervised methodology clustering

Domaines

Sciences du Vivant [q-bio]

Archive Ouverte ProdInra : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01565541

Soumis le : mercredi 19 juillet 2017-20:44:41

Dernière modification le : mardi 12 mars 2024-10:44:02

Dates et versions

hal-01565541 , version 1 (19-07-2017)

Identifiants

HAL Id : hal-01565541 , version 1
PRODINRA : 399091

Citer

Oumaima Alaoui Ismaili, Vincent Lemaire, Antoine Cornuéjols. A supervised methodology to measure the variables contribution to a clustering. 21. International Conference on Neural Information Processing (ICONIP 2014), Nov 2014, kuching, Malaysia. ⟨hal-01565541⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

AGROPARISTECH INRA MIA-PARIS INRAE MATHNUM

92 Consultations

0 Téléchargements

A supervised methodology to measure the variables contribution to a clustering

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager