F. Bénézit, V. Blondel, P. Thiran, J. Tsitsiklis, and M. Vetterli, Weighted Gossip: Distributed Averaging using non-doubly stochastic matrices, 2010 IEEE International Symposium on Information Theory
DOI : 10.1109/ISIT.2010.5513273

L. Bottou, Large-scale machine learning with stochastic gradient descent, International Conference on Computational Statistics, 2010.
DOI : 10.1007/978-3-7908-2604-3_16
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.419.462

L. Bottou, Large-scale machine learning with stochastic gradient descent, Proceedings of COMP- STAT'2010, pp.177-186, 2010.

J. C. Duchi, A. Agarwal, and M. J. Wainwright, Dual averaging for distributed optimization, 2012 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp.592-606, 2012.
DOI : 10.1109/Allerton.2012.6483406

C. Hensel and H. Dutta, Gadget svm: a gossip-based sub-gradient svm solver, International Conference on Machine Learning (ICML), Numerical Mathematics in Machine Learning Workshop, 2009.

Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition, Proceedings of the IEEE, pp.2278-2324, 1998.
DOI : 10.1109/5.726791

Y. Nesterov, Primal-dual subgradient methods for convex problems, Mathematical Programming, vol.8, issue.1, pp.221-259, 2009.
DOI : 10.1007/s10107-007-0149-x

N. L. Roux, M. Schmidt, and F. R. Bach, A stochastic gradient method with an exponential convergence rate for finite training sets, NIPS, pp.2663-2671, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00674995

M. Schmidt, N. L. Roux, and F. Bach, Minimizing finite sums with the stochastic average gradient, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00860051

S. Shalev-shwartz and T. Zhang, Stochastic dual coordinate ascent methods for regularized loss, J. Mach. Learn. Res, vol.14, issue.1, pp.567-599, 2013.

P. Tseng, An Incremental Gradient(-Projection) Method with Momentum Term and Adaptive Stepsize Rule, SIAM Journal on Optimization, vol.8, issue.2, pp.506-531, 1998.
DOI : 10.1137/S1052623495294797

K. I. Tsianos, S. Lawlor, and M. G. Rabbat, Push-Sum Distributed Dual Averaging for convex optimization, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC), pp.5453-5458, 2012.
DOI : 10.1109/CDC.2012.6426375

J. N. Tsitsiklis, Problems in decentralized decision making and computation, 1984.