Choosing a penalty for model selection in heteroscedastic regression

Sylvain Arlot

Preprints, Working Papers, ... Year : 2010

Choosing a penalty for model selection in heteroscedastic regression

(1, 2)

1
2

Sylvain Arlot

Function : Author
PersonId : 1608
IdHAL : sylvain-arlot
IdRef : 124609589

Laboratoire d'informatique de l'école normale supérieure

Models of visual object recognition and scene understanding

Abstract

We consider the problem of choosing between several models in least-squares regression with heteroscedastic data. We prove that any penalization procedure is suboptimal when the penalty is a function of the dimension of the model, at least for some typical heteroscedastic model selection problems. In particular, Mallows' Cp is suboptimal in this framework. On the contrary, optimal model selection is possible with data-driven penalties such as resampling or $V$-fold penalties. Therefore, it is worth estimating the shape of the penalty from data, even at the price of a higher computational cost. Simulation experiments illustrate the existence of a trade-off between statistical accuracy and computational complexity. As a conclusion, we sketch some rules for choosing a penalty in least-squares regression, depending on what is known about possible variations of the noise-level.

Keywords

non-parametric regression model selection penalization heteroscedastic data Mallows Cp resampling penalties

Domains

Statistics [math.ST] Statistics Theory [stat.TH]

Fichier principal

shape.pdf (461.72 Ko)

Origin : Files produced by the author(s)

Sylvain Arlot : Connect in order to contact the contributor

https://hal.science/hal-00347811

Submitted on : Thursday, June 3, 2010-7:24:45 PM

Last modification on : Friday, April 19, 2024-4:18:55 PM

Long-term archiving on: Thursday, September 23, 2010-12:56:52 PM

Dates and versions

hal-00347811 , version 1 (16-12-2008)

hal-00347811 , version 2 (03-06-2010)

Identifiers

HAL Id : hal-00347811 , version 2
ARXIV : 0812.3141

Cite

Sylvain Arlot. Choosing a penalty for model selection in heteroscedastic regression. 2010. ⟨hal-00347811v2⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INRIA2 PSL ANR

856 View

276 Download

Choosing a penalty for model selection in heteroscedastic regression

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share