Adaptivity of averaged stochastic gradient descent to local strong convexity for logistic regression

Francis Bach

Pré-Publication, Document De Travail Année : 2013

Adaptivity of averaged stochastic gradient descent to local strong convexity for logistic regression

(1, 2)

1
2

Francis Bach

Fonction : Auteur
PersonId : 841662

Statistical Machine Learning and Parsimony

Laboratoire d'informatique de l'école normale supérieure

Résumé

In this paper, we consider supervised learning problems such as logistic regression and study the stochastic gradient method with averaging, in the usual stochastic approximation setting where observations are used only once. We show that for self-concordant loss functions, after $n$ iterations, with a constant step-size proportional to $1/R^2 \sqrt{n}$ where $n$ is the number of observations and $R$ is the maximum norm of the observations, the convergence rate is always of order $O(1/\sqrt{n})$, and improves to $O(R^2 / \mu n)$ where $\mu$ is the lowest eigenvalue of the Hessian at the global optimum (when this eigenvalue is strictly positive). Since $\mu$ does not need to be known in advance, this shows that averaged stochastic gradient is adaptive to unknown local strong convexity of the objective function.

Domaines

Apprentissage [cs.LG] Optimisation et contrôle [math.OC]

Fichier principal

twilight_final.pdf (247.38 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Francis Bach : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00804431

Soumis le : lundi 25 mars 2013-15:07:16

Dernière modification le : vendredi 19 avril 2024-16:18:55

Archivage à long terme le : mercredi 26 juin 2013-04:02:44

Dates et versions

hal-00804431 , version 1 (25-03-2013)

hal-00804431 , version 2 (26-10-2013)

hal-00804431 , version 3 (15-03-2014)

Identifiants

HAL Id : hal-00804431 , version 1
ARXIV : 1303.6149

Citer

Francis Bach. Adaptivity of averaged stochastic gradient descent to local strong convexity for logistic regression. 2013. ⟨hal-00804431v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

404 Consultations

312 Téléchargements

Adaptivity of averaged stochastic gradient descent to local strong convexity for logistic regression

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Altmetric

Partager