Model selection by resampling penalization

Sylvain Arlot

doi:10.1214/08-EJS196

Article Dans Une Revue Electronic Journal of Statistics Année : 2009

Model selection by resampling penalization

(1, 2)

1
2

Sylvain Arlot

Fonction : Auteur
PersonId : 1608
IdHAL : sylvain-arlot
IdRef : 124609589

Laboratoire d'informatique de l'école normale supérieure

Models of visual object recognition and scene understanding

Résumé

In this paper, a new family of resampling-based penalization procedures for model selection is defined in a general framework. It generalizes several methods, including Efron's bootstrap penalization and the leave-one-out penalization recently proposed by Arlot (2008), to any exchangeable weighted bootstrap resampling scheme. In the heteroscedastic regression framework, assuming the models to have a particular structure, these resampling penalties are proved to satisfy a non-asymptotic oracle inequality with leading constant close to 1. In particular, they are asympotically optimal. Resampling penalties are used for defining an estimator adapting simultaneously to the smoothness of the regression function and to the heteroscedasticity of the noise. This is remarkable because resampling penalties are general-purpose devices, which have not been built specifically to handle heteroscedastic data. Hence, resampling penalties naturally adapt to heteroscedasticity. A simulation study shows that resampling penalties improve on V-fold cross-validation in terms of final prediction error, in particular when the signal-to-noise ratio is not large.

Mots clés

non-parametric statistics resampling exchangeable weighted bootstrap model selection penalization non-parametric regression adaptivity heteroscedastic data regressogram histogram selection

Domaines

Statistiques [math.ST] Théorie [stat.TH]

Fichier principal

RP.pdf (884.99 Ko)

RP_appendix.pdf (350.77 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Autre

Sylvain Arlot : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00262478

Soumis le : mercredi 17 juin 2009-11:28:29

Dernière modification le : lundi 8 avril 2024-15:49:26

Archivage à long terme le : mercredi 22 septembre 2010-12:36:54

Dates et versions

hal-00262478 , version 1 (11-03-2008)

hal-00262478 , version 2 (17-06-2009)

Identifiants

HAL Id : hal-00262478 , version 2
ARXIV : 0906.3124
DOI : 10.1214/08-EJS196

Citer

Sylvain Arlot. Model selection by resampling penalization. Electronic Journal of Statistics , 2009, 3, pp.557--624. ⟨10.1214/08-EJS196⟩. ⟨hal-00262478v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INRIA2 PSL

786 Consultations

359 Téléchargements

Model selection by resampling penalization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager