Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression

Aymeric Dieuleveut; Nicolas Flammarion; Francis Bach

Article Dans Une Revue Journal of Machine Learning Research Année : 2017

Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression

(1, 2) , (1, 2) , (1, 2)

1
2

Aymeric Dieuleveut

Fonction : Auteur correspondant
PersonId : 1109167
IdHAL : aymeric-dieuleveut
ORCID : 0009-0005-1848-1724

Connectez-vous pour contacter l'auteur

Laboratoire d'informatique de l'école normale supérieure

Statistical Machine Learning and Parsimony

Nicolas Flammarion

Fonction : Auteur

Laboratoire d'informatique de l'école normale supérieure

Statistical Machine Learning and Parsimony

Francis Bach

Fonction : Auteur

Laboratoire d'informatique de l'école normale supérieure

Statistical Machine Learning and Parsimony

Résumé

We consider the optimization of a quadratic objective function whose gradients are only accessible through a stochastic oracle that returns the gradient at any given point plus a zero-mean finite variance random error. We present the first algorithm that achieves jointly the optimal prediction error rates for least-squares regression, both in terms of forgetting of initial conditions in O(1/n 2), and in terms of dependence on the noise and dimension d of the problem, as O(d/n). Our new algorithm is based on averaged accelerated regularized gradient descent, and may also be analyzed through finer assumptions on initial conditions and the Hessian matrix, leading to dimension-free quantities that may still be small while the " optimal " terms above are large. In order to characterize the tightness of these new bounds, we consider an application to non-parametric regression and use the known lower bounds on the statistical performance (without computational limits), which happen to match our bounds obtained from a single pass on the data and thus show optimality of our algorithm in a wide variety of particular trade-offs between bias and variance.

Mots clés

Stochastic gradient Least-squares regression Accelerated gra- dient Non-parametric estimation. Convex optimization

Domaines

Optimisation et contrôle [math.OC] Apprentissage [cs.LG] Autres [stat.ML]

Fichier principal

tighter_hal.pdf (448.96 Ko)

bias-eps-converted-to.pdf (12.85 Ko)

variance-eps-converted-to.pdf (12.9 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Nicolas Flammarion : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01275431

Soumis le : mardi 23 février 2016-21:56:47

Dernière modification le : vendredi 19 avril 2024-16:18:55

Archivage à long terme le : mardi 24 mai 2016-11:07:01

Dates et versions

hal-01275431 , version 1 (17-02-2016)

hal-01275431 , version 2 (23-02-2016)

Identifiants

HAL Id : hal-01275431 , version 2
ARXIV : 1602.05419

Citer

Aymeric Dieuleveut, Nicolas Flammarion, Francis Bach. Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression. Journal of Machine Learning Research, 2017, 17 (101), pp.1-51. ⟨hal-01275431v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INRIA2 TDS-MACS PSL

434 Consultations

1709 Téléchargements

Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager