Skip to Main content Skip to Navigation
Conference papers

New efficient clustering quality indexes

Abstract : This paper deals with a major challenge in clustering that is optimal model selection. It presents new efficient clustering quality indexes relying on feature maximization, which is an alternative measure to usual distributional measures relying on entropy, Chi-square metric or vector-based measures such as Euclidean distance or correlation distance. First Experiments compare the behavior of these new indexes with usual cluster quality indexes based on Euclidean distance on different kinds of test datasets for which ground truth is available. This comparison clearly highlights altogether the superior accuracy and stability of the new method on these datasets, its efficiency from low to high dimensional range and its tolerance to noise. Further experiments are then conducted on " real life " textual data extracted from a multisource bibliographic database for which ground truth is unknown. These experiments show that the accuracy and stability of these new indexes allow to deal efficiently with diachronic analysis, when other indexes do not fit the requirements for this task.
Complete list of metadata

Cited literature [32 references]  Display  Hide  Download
Contributor : Nicolas Dugue Connect in order to contact the contributor
Submitted on : Saturday, July 30, 2016 - 1:39:44 AM
Last modification on : Monday, September 12, 2022 - 2:08:27 PM
Long-term archiving on: : Monday, October 31, 2016 - 10:20:16 AM


Files produced by the author(s)


  • HAL Id : hal-01350509, version 1



Jean-Charles Lamirel, Nicolas Dugué, Pascal Cuxac. New efficient clustering quality indexes. International Joint Conference on Neural Networks (IJCNN 2016), Jul 2016, Vancouver, Canada. ⟨hal-01350509⟩



Record views


Files downloads