High Order Numerical Approximation of the Invariant Measure of Ergodic SDEs, SIAM Journal on Numerical Analysis, vol.52, issue.4, pp.1600-1622, 2014. ,
DOI : 10.1137/130935616
URL : https://hal.archives-ouvertes.fr/hal-00858088
Adaptivity of averaged stochastic gradient descent to local strong convexity for logistic regression, J. Mach. Learn. Res, vol.15, issue.1, pp.595-627, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00804431
Non-strongly-convex smooth stochastic approximation with convergence rate O(1/n) Advances, in Neural Information Processing Systems (NIPS), 2013. ,
Nonlinear programming, Athena Scientific, 1995. ,
The tradeoffs of large scale learning, Advances in Neural Information Processing Systems (NIPS), 2008. ,
On the convergence of stochastic gradient MCMC algorithms with high-order integrators, NIPS, pp.2269-2277, 2015. ,
Averaged least-mean-squares: bias-variance trade-offs and optimal sampling distributions, Proceedings of the International Conference on Artificial Intelligence and Statistics, p.2015 ,
Nonparametric stochastic approximation with large step-sizes, The Annals of Statistics, vol.44, issue.4, pp.1363-1399 ,
DOI : 10.1214/15-AOS1391
URL : https://hal.archives-ouvertes.fr/hal-01053831
Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression. ArXiv e-prints, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01275431
Stochastic Gradient Richardson- Romberg Markov Chain Monte Carlo, Advances in Neural Information Processing Systems, pp.2047-2055, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01354064
Ordinary Differential Equations: Second Edition, Classics in Applied Mathematics. Society for Industrial and Applied Mathematics, 1982. ,
Parallelizing Stochastic Approximation Through Mini-Batching and Tail-Averaging. ArXiv e-prints, 2016. ,
Accelerating Stochastic Gradient Descent, 2017. ,
On the Markov chain central limit theorem, Probability Surveys, vol.1, issue.0, pp.299-320, 2004. ,
DOI : 10.1214/154957804100000051
URL : http://arxiv.org/abs/math/0409112
Stochastic Approximation and Recursive Algorithms and Applications, 2003. ,
An optimal method for stochastic composite optimization, Mathematical Programming, vol.24, issue.1-2, pp.365-397, 2012. ,
DOI : 10.1023/A:1021814225969
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.416.1110
Why do partitions occur in Faa di Bruno's chain rule for higher derivatives?, 2006. ,
Stochastic approximation and optimization of random systems. DMV Seminar, 1992. ,
DOI : 10.1007/978-3-0348-8609-3
Ergodicity for SDEs and approximations: locally Lipschitz vector fields and degenerate noise, Stochastic Processes and their Applications, vol.101, issue.2, pp.185-232, 2002. ,
DOI : 10.1016/S0304-4149(02)00150-3
URL : http://doi.org/10.1016/s0304-4149(02)00150-3
Markov Chains and Stochastic Stability, 2009. ,
Markov chains and stochastic stability, 1993. ,
Convergence rate of incremental subgradient algorithms, Stochastic optimization: algorithms and applications, pp.223-264, 2001. ,
Stochastic gradient descent, weighted sampling, and the randomized Kaczmarz algorithm, Advances in Neural Information Processing Systems 27, pp.1017-1025, 2014. ,
DOI : 10.1137/120889897
URL : http://arxiv.org/abs/1310.5715
Robust Stochastic Approximation Approach to Stochastic Programming, SIAM Journal on Optimization, vol.19, issue.4, pp.1574-1609, 2009. ,
DOI : 10.1137/070704277
URL : https://hal.archives-ouvertes.fr/hal-00976649
Introductory Lectures on Convex Optimization: A Basic Course. Applied Optimization, 2004. ,
DOI : 10.1007/978-1-4419-8853-9
Confidence level solutions for stochastic programming, Automatica, vol.44, issue.6, pp.1559-1568, 2008. ,
DOI : 10.1016/j.automatica.2008.01.017
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.34.5840
Stochastic Minimization with Constant Step-Size: Asymptotic Laws, SIAM Journal on Control and Optimization, vol.24, issue.4, pp.655-666, 1986. ,
DOI : 10.1137/0324039
Acceleration of Stochastic Approximation by Averaging, SIAM Journal on Control and Optimization, vol.30, issue.4, pp.838-855, 1992. ,
DOI : 10.1137/0330046
Making Gradient Descent Optimal for Strongly Convex Stochastic Optimization. ArXiv e-prints, 2011. ,
A stochastic approxiation method. The Annals of mathematical, Statistics, vol.22, issue.3, pp.400-407, 1951. ,
Efficient estimations from a slowly convergent Robbins-Monro process, 1988. ,
Pegasos, Proceedings of the 24th international conference on Machine learning, ICML '07, pp.807-814, 2007. ,
DOI : 10.1145/1273496.1273598
Stochastic convex optimization, Proceedings of the International Conference on Learning Theory (COLT), 2009. ,
Stochastic Gradient Descent for Non-smooth Optimization: Convergence Results and Optimal Averaging Schemes, Proceedings of the 30 t h International Conference on Machine Learning, 2013. differential equations. Stochastic Anal, pp.483-509, 1990. ,
Optimal transport : old and new. Grundlehren der mathematischen Wissenschaften, 2009. ,
DOI : 10.1007/978-3-540-71050-9
Bayesian learning via Stochastic Gradient Langevin Dynamics, ICML, pp.681-688, 2011. ,
Co-Coercivity and Its Role in the Convergence of Iterative Schemes for Solving Variational Inequalities, SIAM Journal on Optimization, vol.6, issue.3, pp.714-726, 1996. ,
DOI : 10.1137/S1052623494250415