Model-based clustering of high-dimensional data in Astrophysics - Archive ouverte HAL Accéder directement au contenu
Chapitre D'ouvrage Année : 2016

Model-based clustering of high-dimensional data in Astrophysics

Résumé

The nature of data in Astrophysics has changed, as in other scientific fields, in the past decades due to the increase of the measurement capabilities. As a consequence, data are nowadays frequently of high dimensionality and available in mass or stream. Model-based techniques for clustering are popular tools which are renowned for their probabilistic foundations and their flexibility. However, classical model-based techniques show a disappointing behavior in high-dimensional spaces which is mainly due to their dramatical over-parametrization. The recent developments in model-based classification overcome these drawbacks and allow to efficiently classify high-dimensional data, even in the " small n / large p " situation. This work presents a comprehensive review of these recent approaches, including regularization-based techniques, parsimonious modeling, subspace classification methods and classification methods based on variable selection. The use of these model-based methods is also illustrated on real-world classification problems in Astrophysics using R packages.
Fichier principal
Vignette du fichier
chapitreAstro.pdf (977.3 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01264844 , version 1 (29-01-2016)
hal-01264844 , version 2 (09-02-2016)

Licence

Paternité

Identifiants

  • HAL Id : hal-01264844 , version 2

Citer

Charles Bouveyron. Model-based clustering of high-dimensional data in Astrophysics. Statistics for Astrophysics: Clustering and Classification, EAS Publications Series, 77, EDP Sciences, pp.91-119, 2016. ⟨hal-01264844v2⟩
254 Consultations
553 Téléchargements

Partager

Gmail Facebook X LinkedIn More