Skip to Main content Skip to Navigation
Journal articles

Evolutionary Subspace Clustering Using Variable Genome Length

Sergio Peignier 1 Christophe Rigotti 2, 3, 4 Guillaume Beslon 2
2 BEAGLE - Artificial Evolution and Computational Biology
LBBE - Laboratoire de Biométrie et Biologie Evolutive - UMR 5558, Inria Grenoble - Rhône-Alpes, LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
3 DM2L - Data Mining and Machine Learning
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : Subspace clustering is a data-mining task that groups similar data objects and at the same time searches the subspaces where similarities appear. For this reason, subspace clustering is recognized as more general and complicated than standard clustering. In this article, we present ChameleoClust+, a bioinspired evolutionary subspace clustering algorithm that takes advantage of an evolvable genome structure to detect various numbers of clusters located in different subspaces. ChameleoClust+ incorporates several biolike features such as a variable genome length, both functional and nonfunctional elements, and mutation operators including large rearrangements. It was assessed and compared with the state-of-the-art methods on a reference benchmark using both real-world and synthetic data sets. Although other algorithms may need complex parameter settings, ChameleoClust+ needs to set only one subspace clustering ad hoc and intuitive parameter: the maximal number of clusters. The remaining parameters of ChameleoClust+ are related to the evolution strategy (eg, population size, mutation rate), and a single setting for all of them turned out to be effective for all the benchmark data sets. A sensitivity analysis has also been carried out to study the impact of each parameter on the subspace clustering quality.
Document type :
Journal articles
Complete list of metadata
Contributor : Christophe Rigotti Connect in order to contact the contributor
Submitted on : Wednesday, December 11, 2019 - 5:41:33 PM
Last modification on : Tuesday, July 20, 2021 - 5:20:04 PM
Long-term archiving on: : Thursday, March 12, 2020 - 11:08:15 PM


Files produced by the author(s)


Distributed under a Creative Commons Attribution 4.0 International License



Sergio Peignier, Christophe Rigotti, Guillaume Beslon. Evolutionary Subspace Clustering Using Variable Genome Length. Computational Intelligence, Wiley, 2020, 36 (2), pp.574-612. ⟨10.1111/coin.12254⟩. ⟨hal-02405598⟩



Les métriques sont temporairement indisponibles