Selection of GLM mixtures: a new criterion for clustering purpose

Abstract : Model-based clustering from finite mixtures of generalized linear models is a challenging issue which has undergone many recent developments. In practice, the model selection step is usually performed by using AIC or BIC penalized criteria. Though, simulations show that they tend to overestimate the actual dimension of the model. These evidence led us to consider a new criterion close to ICL, firstly introduced in Baudry (2009). Its definition requires to introduce a contrast embedding an entropic term: using concentration inequalities, we derive key properties about the convergence of the associated M-estimator. The consistency of the corresponding classification criterion then follows depending on some classical requirements on the penalty term. Finally a simulation study enables to corroborate our theoretical results, and shows the effectiveness of the method in a clustering perspective.
Document type :
Preprints, Working Papers, ...
Liste complète des métadonnées

Cited literature [35 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00957880
Contributor : Olivier Lopez <>
Submitted on : Tuesday, March 11, 2014 - 1:53:11 PM
Last modification on : Tuesday, April 2, 2019 - 2:24:47 AM
Document(s) archivé(s) le : Wednesday, June 11, 2014 - 11:36:23 AM

File

MixtureSelection_25-02-2014.pd...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00957880, version 1

Citation

Olivier Lopez, Milhaud Xavier. Selection of GLM mixtures: a new criterion for clustering purpose. 2014. ⟨hal-00957880⟩

Share

Metrics

Record views

558

Files downloads

497