Quantitative bounds for concentration-of-measure inequalities and empirical regression: the independent case - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of Complexity Année : 2019

Quantitative bounds for concentration-of-measure inequalities and empirical regression: the independent case

Résumé

This paper is devoted to the study of the deviation of the (random) average $L^{2}-$error associated to the least--squares regressor over a family of functions ${\cal F}_{n}$ (with controlled complexity) obtained from $n$ independent, but not necessarily identically distributed, samples of explanatory and response variables, from the minimal (deterministic) average $L^{2}-$error associated to this family of functions, and to some of the corresponding consequences for the problem of consistency. In the i.i.d. case, this specializes as classical questions on least--squares regression problems, but in more general cases, this setting permits a precise investigation in the direction of the study of nonasymptotic errors for least--squares regression schemes in nonstationary settings, which we motivate providing background and examples. More precisely, we prove first two nonasymptotic deviation inequalities that generalize and refine corresponding known results in the i.i.d. case. We then explore some consequences for nonasymptotic bounds of the error both in the weak and the strong senses. Finally, we exploit these estimates to shed new light into questions of consistency for least--squares regression schemes in the distribution--free, nonparametric setting. As an application to the classical theory, we provide in particular a result that generalizes the link between the problem of consistency and the Glivenko-Cantelli property, which applied to regression in the i.i.d. setting over non--decreasing families $({\cal F}_{n})_{n}$ of functions permits to create a scheme which is strongly consistent in $L^{2}$ under the sole (necessary) assumption of the existence of functions in $\cup_{n}{\cal F}_{n}$ which are arbitrarily close in $L^{2}$ to the corresponding regressor.
Fichier principal
Vignette du fichier
FinalVersionRevision_JoC_for_publication.pdf (725.98 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01832195 , version 1 (06-07-2018)
hal-01832195 , version 2 (26-01-2019)

Identifiants

Citer

David Barrera, Emmanuel Gobet. Quantitative bounds for concentration-of-measure inequalities and empirical regression: the independent case. Journal of Complexity, 2019, 52, pp.45-81. ⟨10.1016/j.jco.2019.01.003⟩. ⟨hal-01832195v2⟩
282 Consultations
341 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More