Hybrid Strategy for Selecting Compact Set of Clustering Partitions - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of Applied Soft Computing, Springer Année : 2020

Hybrid Strategy for Selecting Compact Set of Clustering Partitions

Résumé

The selection of the most appropriate clustering algorithm is not a straightforward task, given that there is no clustering algorithm capable of determining the actual groups present in any dataset. A potential solution is to use different clustering algorithms to produce a set of partitions (solutions) and then select the best partition produced according to a specified validation measure; these measures are generally biased toward one or more clustering algorithms. Nevertheless, in several real cases, it is important to have more than one solution as the output. To address these problems, we present a hybrid partition selection algorithm, HSS, which accepts as input a set of base partitions potentially generated from clustering algorithms with different biases and aims, to return a reduced and yet diverse set of partitions (solutions). HSS comprises three steps: (i) the application of a multiobjective algorithm to a set of base partitions to generate a Pareto Front (PF) approximation; (ii) the division of the solutions from the PF approximation into a certain number of regions; and (iii) the selection of a solution per region by applying the Adjusted Rand Index. We compare the results of our algorithm with those of another selection strategy, ASA. Furthermore, we test HSS as a post-processing tool for two clustering algorithms based on multiobjective evolutionary computing: MOCK and MOCLE. The experiments revealed the effectiveness of HSS in selecting a reduced number of partitions while maintaining their quality.
Fichier non déposé

Dates et versions

hal-02388417 , version 1 (01-12-2019)

Identifiants

Citer

Vanessa Antunes, Tiemi Sakata, Katti Faceli, Marcilio de Souto. Hybrid Strategy for Selecting Compact Set of Clustering Partitions. Journal of Applied Soft Computing, Springer, 2020, 87, pp.105971. ⟨10.1016/j.asoc.2019.105971⟩. ⟨hal-02388417⟩
51 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More