Sparsification as a Remedy for Staleness in Distributed Asynchronous SGD

Rosa Candela; Giulio Franzese; Maurizio Filippone; Pietro Michiardi

Pré-Publication, Document De Travail Année : 2019

Sparsification as a Remedy for Staleness in Distributed Asynchronous SGD

(1) , (1) , (1) , (1)

Rosa Candela

Fonction : Auteur

Eurecom [Sophia Antipolis]

Giulio Franzese

Fonction : Auteur

Eurecom [Sophia Antipolis]

Maurizio Filippone

Fonction : Auteur
PersonId : 1021042

Eurecom [Sophia Antipolis]

Pietro Michiardi

Fonction : Auteur
PersonId : 1084771

Eurecom [Sophia Antipolis]

Résumé

Large scale machine learning is increasingly relying on distributed optimization, whereby several machines contribute to the training process of a statistical model. In this work we study the performance of asynchronous, distributed settings, when applying sparsification, a technique used to reduce communication overheads. In particular, for the first time in an asynchronous, non-convex setting, we theoretically prove that, in presence of staleness, sparsification does not harm SGD performance: the ergodic convergence rate matches the known result of standard SGD, that is $\mathcal{O} \left( 1/\sqrt{T} \right)$. We also carry out an empirical study to complement our theory, and confirm that the effects of sparsification on the convergence rate are negligible, when compared to 'vanilla' SGD, even in the challenging scenario of an asynchronous, distributed system.

Mots clés

Stochastic optimization Asynchronous Sparsification

Domaines

Ingénierie assistée par ordinateur

Centre De Documentation Eurecom : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03345253

Soumis le : mercredi 15 septembre 2021-13:48:48

Dernière modification le : jeudi 16 septembre 2021-03:40:44

Dates et versions

hal-03345253 , version 1 (15-09-2021)

Identifiants

HAL Id : hal-03345253 , version 1
ARXIV : 1910.09466

Citer

Rosa Candela, Giulio Franzese, Maurizio Filippone, Pietro Michiardi. Sparsification as a Remedy for Staleness in Distributed Asynchronous SGD. 2019. ⟨hal-03345253⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EURECOM 3IA-COTEDAZUR ANR

19 Consultations

0 Téléchargements

Sparsification as a Remedy for Staleness in Distributed Asynchronous SGD

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager