Sparse Oracle Inequalities for Variable Selection via Regularized Quantization

Clément Levrard

doi:10.3150/16-BEJ876

Article Dans Une Revue Bernoulli Année : 2018

Sparse Oracle Inequalities for Variable Selection via Regularized Quantization

(1, 2)

1
2

Clément Levrard

Fonction : Auteur
PersonId : 919369

Laboratoire de Probabilités et Modèles Aléatoires

Laboratoire de Probabilités, Statistique et Modélisation

Résumé

We give oracle inequalities on procedures which combines quantization and variable selection via a weighted Lasso $k$-means type algorithm. The results are derived for a general family of weights, which can be tuned to size the influence of the variables in different ways. Moreover, these theoretical guarantees are proved to adapt the corresponding sparsity of the optimal codebooks, if appropriate. Even if there is no sparsity assumption on the optimal codebooks, our procedure is proved to be close to a sparse approximation of the optimal codebooks, as has been done for the Generalized Linear Models in regression. If the optimal codebooks have a sparse support, we also show that this support can be asymptotically recovered, giving an asymptotic upper bound on the probability of misclassification. These results are illustrated with Gaussian mixture models in arbitrary dimension with sparsity assumptions on the means, which are standard distributions in model-based clustering.

Mots clés

Lasso k means Quantization Sparsity Oracle Inequality Variable selection

Domaines

Statistiques [math.ST] Théorie [stat.TH]

Fichier principal

Sparseoracleinequalitiesforfeatureselectionviaregularizedquantizationv2.pdf (171.72 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Clément Levrard : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01005545

Soumis le : mercredi 6 juillet 2016-11:28:35

Dernière modification le : jeudi 14 mars 2024-03:11:29

Archivage à long terme le : vendredi 7 octobre 2016-10:28:30

Dates et versions

hal-01005545 , version 1 (12-06-2014)

hal-01005545 , version 2 (15-04-2015)

hal-01005545 , version 3 (06-07-2016)

Identifiants

HAL Id : hal-01005545 , version 3
ARXIV : 1406.3334
DOI : 10.3150/16-BEJ876

Citer

Clément Levrard. Sparse Oracle Inequalities for Variable Selection via Regularized Quantization. Bernoulli, 2018, 24 (1), ⟨10.3150/16-BEJ876⟩. ⟨hal-01005545v3⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS7 PMA CNRS USPC LPSM SORBONNE-UNIVERSITE SU-SCIENCES UP-SCIENCES ANR

381 Consultations

377 Téléchargements

Sparse Oracle Inequalities for Variable Selection via Regularized Quantization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager