The first direct acceleration of stochastic gradient methods, Proceedings of Symposium on Theory of Computing, pp.1200-1205, 2017. ,
Synchronization and Linearity: an Algebra for Discrete Event Systems, 1992. ,
Large-scale machine learning with stochastic gradient descent, Proceedings of COMPSTAT, pp.177-186, 2010. ,
Randomized gossip algorithms, IEEE Transactions on Information Theory, vol.52, issue.6, pp.2508-2530, 2006. ,
, Convex optimization: Algorithms and complexity. Foundations and Trends R in Machine Learning, vol.8, pp.231-357, 2015.
, , 2016.
Gossip dual averaging for decentralized optimization of pairwise functions, Proceedings of the International Conference on International Conference on Machine Learning, vol.48, pp.1388-1396, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01329315
A simple practical accelerated method for finite sums, Advances in Neural Information Processing Systems, pp.676-684, 2016. ,
SAGA: A fast incremental gradient method with support for non-strongly convex composite objectives, Advances in Neural Information Processing Systems, pp.1646-1654, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01016843
Dual averaging for distributed optimization: Convergence analysis and network scaling, IEEE Transactions on Automatic Control, vol.57, issue.3, pp.592-606, 2012. ,
Accelerated, parallel, and proximal coordinate descent, SIAM Journal on Optimization, vol.25, issue.4, pp.1997-2023, 2015. ,
Cola: Decentralized linear learning, Advances in Neural Information Processing Systems, pp.4536-4546, 2018. ,