Model selection by resampling penalization

Sylvain Arlot

doi:10.1214/08-EJS196

Journal Articles Electronic Journal of Statistics Year : 2009

Model selection by resampling penalization

(1, 2)

1
2

Sylvain Arlot

Function : Author
PersonId : 1608
IdHAL : sylvain-arlot
IdRef : 124609589

Laboratoire d'informatique de l'école normale supérieure

Models of visual object recognition and scene understanding

Abstract

In this paper, a new family of resampling-based penalization procedures for model selection is defined in a general framework. It generalizes several methods, including Efron's bootstrap penalization and the leave-one-out penalization recently proposed by Arlot (2008), to any exchangeable weighted bootstrap resampling scheme. In the heteroscedastic regression framework, assuming the models to have a particular structure, these resampling penalties are proved to satisfy a non-asymptotic oracle inequality with leading constant close to 1. In particular, they are asympotically optimal. Resampling penalties are used for defining an estimator adapting simultaneously to the smoothness of the regression function and to the heteroscedasticity of the noise. This is remarkable because resampling penalties are general-purpose devices, which have not been built specifically to handle heteroscedastic data. Hence, resampling penalties naturally adapt to heteroscedasticity. A simulation study shows that resampling penalties improve on V-fold cross-validation in terms of final prediction error, in particular when the signal-to-noise ratio is not large.

Keywords

non-parametric statistics resampling exchangeable weighted bootstrap model selection penalization non-parametric regression adaptivity heteroscedastic data regressogram histogram selection

Domains

Statistics [math.ST] Statistics Theory [stat.TH]

Fichier principal

RP.pdf (884.99 Ko)

RP_appendix.pdf (350.77 Ko)

Origin : Files produced by the author(s)

Format : Other

Sylvain Arlot : Connect in order to contact the contributor

https://hal.science/hal-00262478

Submitted on : Wednesday, June 17, 2009-11:28:29 AM

Last modification on : Friday, April 19, 2024-4:18:55 PM

Long-term archiving on: Wednesday, September 22, 2010-12:36:54 PM

Dates and versions

hal-00262478 , version 1 (11-03-2008)

hal-00262478 , version 2 (17-06-2009)

Identifiers

HAL Id : hal-00262478 , version 2
ARXIV : 0906.3124
DOI : 10.1214/08-EJS196

Cite

Sylvain Arlot. Model selection by resampling penalization. Electronic Journal of Statistics , 2009, 3, pp.557--624. ⟨10.1214/08-EJS196⟩. ⟨hal-00262478v2⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INRIA2 PSL

786 View

359 Download

Model selection by resampling penalization

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share