Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Machine Learning

Francis Bach; Eric Moulines

Conference Papers Year : 2011

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Machine Learning

(1, 2) , (3)

1
2
3

Francis Bach

Function : Author
PersonId : 841662

Laboratoire d'informatique de l'école normale supérieure

Statistical Machine Learning and Parsimony

Eric Moulines

Function : Author
PersonId : 1350242
ORCID : 0000-0002-2058-0693
IdRef : 076452476

Laboratoire Traitement et Communication de l'Information

Abstract

In this paper, we consider the minimization of a convex objective function defined on a Hilbert space, which is only available through unbiased estimates of its gradients. This problem includes standard machine learning algorithms such as kernel logistic regression and least-squares regression, and is commonly referred to as a stochastic approximation problem in the operations research community. We provide a non-asymptotic analysis of the convergence of two well-known algorithms, stochastic gradient descent (a.k.a. Robbins-Monro algorithm) as well as a simple modification where iterates are averaged (a.k.a. Polyak-Ruppert averaging). Our analysis suggests that a learning rate proportional to the inverse of the number of iterations, while leading to the optimal convergence rate in the strongly convex case, is not robust to the lack of strong convexity or the setting of the proportionality constant. This situation is remedied when using slower decays together with averaging, robustly leading to the optimal rate of convergence. We illustrate our theoretical results with simulations on synthetic and standard datasets.

Domains

Machine Learning [cs.LG] Optimization and Control [math.OC] Machine Learning [stat.ML] Signal and Image processing Signal and Image Processing

Fichier principal

gradsto_hal.pdf (642.5 Ko)

Origin : Files produced by the author(s)

Francis Bach : Connect in order to contact the contributor

https://hal.science/hal-00608041

Submitted on : Tuesday, July 12, 2011-9:17:30 AM

Last modification on : Friday, April 19, 2024-4:18:55 PM

Long-term archiving on: Thursday, October 13, 2011-2:21:19 AM

Dates and versions

hal-00608041 , version 1 (12-07-2011)

Identifiers

HAL Id : hal-00608041 , version 1

Cite

Francis Bach, Eric Moulines. Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Machine Learning. Neural Information Processing Systems (NIPS), 2011, Spain. ⟨hal-00608041⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM ENS-PARIS CNRS INRIA PARISTECH INRIA2 TDS-MACS PSL LTCI

19209 View

3502 Download

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Machine Learning

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share