Skip to Main content Skip to Navigation
Conference papers

New efficient clustering quality indexes

Abstract : This paper deals with a major challenge in clustering that is optimal model selection. It presents new efficient clustering quality indexes relying on feature maximization, which is an alternative measure to usual distributional measures relying on entropy, Chi-square metric or vector-based measures such as Euclidean distance or correlation distance. First Experiments compare the behavior of these new indexes with usual cluster quality indexes based on Euclidean distance on different kinds of test datasets for which ground truth is available. This comparison clearly highlights altogether the superior accuracy and stability of the new method on these datasets, its efficiency from low to high dimensional range and its tolerance to noise. Further experiments are then conducted on " real life " textual data extracted from a multisource bibliographic database for which ground truth is unknown. These experiments show that the accuracy and stability of these new indexes allow to deal efficiently with diachronic analysis, when other indexes do not fit the requirements for this task.
Complete list of metadatas

Cited literature [32 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01350509
Contributor : Nicolas Dugue <>
Submitted on : Saturday, July 30, 2016 - 1:39:44 AM
Last modification on : Wednesday, March 18, 2020 - 2:56:38 PM
Document(s) archivé(s) le : Monday, October 31, 2016 - 10:20:16 AM

File

LamirelIJCNN2016.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01350509, version 1

Collections

Citation

Jean-Charles Lamirel, Nicolas Dugué, Pascal Cuxac. New efficient clustering quality indexes. International Joint Conference on Neural Networks (IJCNN 2016), Jul 2016, Vancouver, Canada. ⟨hal-01350509⟩

Share

Metrics

Record views

473

Files downloads

1295