Selection of GLM mixtures: a new criterion for clustering purpose

Abstract : Model-based clustering from finite mixtures of generalized linear models is a challenging issue which has undergone many recent developments. In practice, the model selection step is usually performed by using AIC or BIC penalized criteria. Though, simulations show that they tend to overestimate the actual dimension of the model. These evidence led us to consider a new criterion close to ICL, firstly introduced in Baudry (2009). Its definition requires to introduce a contrast embedding an entropic term: using concentration inequalities, we derive key properties about the convergence of the associated M-estimator. The consistency of the corresponding classification criterion then follows depending on some classical requirements on the penalty term. Finally a simulation study enables to corroborate our theoretical results, and shows the effectiveness of the method in a clustering perspective.
Type de document :
Pré-publication, Document de travail
2014
Liste complète des métadonnées


https://hal.archives-ouvertes.fr/hal-00957880
Contributeur : Olivier Lopez <>
Soumis le : mardi 11 mars 2014 - 13:53:11
Dernière modification le : mardi 30 mai 2017 - 01:17:30
Document(s) archivé(s) le : mercredi 11 juin 2014 - 11:36:23

Fichier

MixtureSelection_25-02-2014.pd...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00957880, version 1

Collections

Citation

Olivier Lopez, Milhaud Xavier. Selection of GLM mixtures: a new criterion for clustering purpose. 2014. <hal-00957880>

Partager

Métriques

Consultations de
la notice

403

Téléchargements du document

267