Active Learning for Semi-Supervised K-Means Clustering

Viet Vu Vu; Nicolas Labroche; Bernadette Bouchon-Meunier

doi:10.1109/ICTAI.2010.11

Communication Dans Un Congrès Année : 2010

Active Learning for Semi-Supervised K-Means Clustering

(1) , (1) , (1)

Viet Vu Vu

Fonction : Auteur
PersonId : 980044

Machine Learning and Information Retrieval

Nicolas Labroche

Fonction : Auteur
PersonId : 4509
IdHAL : nicolas-labroche
ORCID : 0000-0002-2794-2124
IdRef : 132080303

Machine Learning and Information Retrieval

Bernadette Bouchon-Meunier

Fonction : Auteur
PersonId : 9708
IdHAL : bernadette-bouchon-meunier
ORCID : 0000-0002-7937-7796
IdRef : 031064442

Machine Learning and Information Retrieval

Résumé

K-Means algorithm is one of the most used clustering algorithm for Knowledge Discovery in Data Mining. Seed based K-Means is the integration of a small set of labeled data (called seeds) to the K-Means algorithm to improve its performances and overcome its sensitivity to initial centers. These centers are, most of the time, generated at random or they are assumed to be available for each cluster. This paper introduces a new efficient algorithm for active seeds selection which relies on a Min-Max approach that favors the coverage of the whole dataset. Experiments conducted on artificial and real datasets show that, using our active seeds selection algorithm, each cluster contains at least one seed after a very small number of queries and thus helps reducing the number of iterations until convergence which is crucial in many KDD applications.

Domaines

Informatique [cs]

Lip6 Publications : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01292094

Soumis le : mardi 22 mars 2016-15:23:42

Dernière modification le : mardi 11 avril 2023-15:16:28

Dates et versions

hal-01292094 , version 1 (22-03-2016)

Identifiants

HAL Id : hal-01292094 , version 1
DOI : 10.1109/ICTAI.2010.11

Citer

Viet Vu Vu, Nicolas Labroche, Bernadette Bouchon-Meunier. Active Learning for Semi-Supervised K-Means Clustering. The 22th IEEE International Conference on Tools with Artificial Intelligence (ICTAI-2010), Oct 2010, Arras, France. pp.12-15, ⟨10.1109/ICTAI.2010.11⟩. ⟨hal-01292094⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

101 Consultations

0 Téléchargements

Active Learning for Semi-Supervised K-Means Clustering

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager