An Efficient Active Constraint Selection Algorithm for Clustering

Abstract : In this paper, we address the problem of active query selection for clustering with constraints. The objective is to determine automatically a set of queries and their associated must-link and can-not link constraints to help constraints based clustering algorithms to converge. Some works on active constraints learning have already been proposed but they are only applied to K-Means like clustering algorithms which are known to be limited to spherical clusters while we are interested in constraints-based clustering algorithms that deals with clusters of arbitrary shapes and sizes (like Constrained-DBSCAN, Constrained-Hierarchical Clustering. . . ). Our novel approach relies on a k-nearest neighbors graph to estimate the dense regions of the data space and generates queries at the frontier between clusters where the cluster membership is most uncertain. Experiments show that our framework improves the performance of constraints based clustering algorithms.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01291865
Contributor : Lip6 Publications <>
Submitted on : Tuesday, March 22, 2016 - 11:22:19 AM
Last modification on : Thursday, March 21, 2019 - 1:04:54 PM

Identifiers

Citation

Viet Vu Vu, Nicolas Labroche, Bernadette Bouchon-Meunier. An Efficient Active Constraint Selection Algorithm for Clustering. The 20th International Conference on Pattern Recognition (ICPR-2010), Aug 2010, Istanbul, Turkey. pp.2969-2972, ⟨10.1109/ICPR.2010.727⟩. ⟨hal-01291865⟩

Share

Metrics

Record views

178