From Averaging to Acceleration, There is Only a Step-size - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

From Averaging to Acceleration, There is Only a Step-size

Résumé

We show that accelerated gradient descent, averaged gradient descent and the heavy-ball method for non-strongly-convex problems may be reformulated as constant parameter second-order difference equation algorithms, where stability of the system is equivalent to convergence at rate O(1/n 2), where n is the number of iterations. We provide a detailed analysis of the eigenvalues of the corresponding linear dynamical system , showing various oscillatory and non-oscillatory behaviors, together with a sharp stability result with explicit constants. We also consider the situation where noisy gradients are available, where we extend our general convergence result, which suggests an alternative algorithm (i.e., with different step sizes) that exhibits the good aspects of both averaging and acceleration.
Fichier principal
Vignette du fichier
faatos_hal.pdf (471.78 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01136945 , version 1 (30-03-2015)

Identifiants

Citer

Nicolas Flammarion, Francis Bach. From Averaging to Acceleration, There is Only a Step-size. Proceedings of The 28th Conference on Learning Theory, (COLT) , 2015, Paris France. ⟨hal-01136945⟩
255 Consultations
1385 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More