S. Amari, Natural Gradient Works Efficiently in Learning, Neural Computation, vol.37, issue.2, pp.251-276, 1998.
DOI : 10.1103/PhysRevLett.76.2188

A. Anandkumar, V. Tan, F. Huang, and A. Willsky, High-dimensional Gaussian graphical model selection: Walk summability and local separation criterion, JMLR, pp.2293-2337, 2012.

L. Arnold, A. Auger, N. Hansen, and Y. Ollivier, Information-Geometric Optimization Algorithms: A Unifying Picture via Invariance Principles, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00601503

F. Chang, Inversion of a perturbed matrix, Applied Mathematics Letters, vol.19, issue.2, pp.169-173, 2006.
DOI : 10.1016/j.aml.2005.04.004

C. Hsieh, M. A. Sustik, I. S. Ravikumar, and P. , Sparse inverse covariance matrix estimation using quadratic approximation, Advancees in Neural Information Processing Systems, 2011.

C. Chow and C. Liu, Approximating discrete probability distributions with dependence trees. Information Theory, IEEE Transactions on, vol.14, issue.3, pp.462-467, 1968.
DOI : 10.1109/tit.1968.1054142

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.133.9772

S. Cocco and R. Monasson, Adaptive Cluster Expansion for the Inverse Ising Problem: Convergence, Algorithm and Tests, Journal of Statistical Physics, vol.5, issue.2, 2011.
DOI : 10.1007/s10955-012-0463-4

URL : https://hal.archives-ouvertes.fr/hal-00634921

S. Cocco, R. Monasson, and V. Sessak, High-dimensional inference with the generalized Hopfield model: Principal component analysis and corrections, Physical Review E, vol.83, issue.5, p.51123, 2011.
DOI : 10.1103/PhysRevE.83.051123

URL : https://hal.archives-ouvertes.fr/hal-00586950

A. De-palma and F. Marchal, Real cases applications of the fully dynamic METROPOLIS tool-box: an advocacy for large-scale mesoscopic transportation systems, Networks and Spatial Economics, vol.2, pp.4-347, 2002.

B. Dong and Y. Zhang, An Efficient Algorithm for ??? 0 Minimization in Wavelet Frame Based Image Restoration, Journal of Scientific Computing, vol.86, issue.2, 2012.
DOI : 10.1007/s10915-012-9597-4

J. Eckstein, Nonlinear Proximal Point Algorithms Using Bregman Functions, with Applications to Convex Programming, Mathematics of Operations Research, vol.18, issue.1, pp.11-48, 1993.
DOI : 10.1287/moor.18.1.202

J. Fan, L. , and R. , Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties, Journal of the American Statistical Association, vol.96, issue.456, pp.1348-1360, 2001.
DOI : 10.1198/016214501753382273

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.128.4174

J. Friedman, T. Hastie, and R. Tibshirani, Sparse inverse covariance estimation with the graphical lasso, Biostatistics, vol.9, issue.3, pp.432-441, 2008.
DOI : 10.1093/biostatistics/kxm045

C. Furtlehner, Y. Han, J. Lasgouttes, V. Martin, F. Marchal et al., Spatial and temporal analysis of traffic states on large scale networks, 13th International IEEE Conference on Intelligent Transportation Systems, pp.1215-1220, 2010.
DOI : 10.1109/ITSC.2010.5625175

URL : https://hal.archives-ouvertes.fr/hal-00527481

C. Furtlehner, J. Lasgouttes, and A. Auger, Learning multiple belief propagation fixed points for real time inference, Physica A: Statistical Mechanics and its Applications, vol.389, issue.1, pp.149-163, 2010.
DOI : 10.1016/j.physa.2009.08.030

URL : https://hal.archives-ouvertes.fr/inria-00371372

C. Furtlehner, J. Lasgouttes, and A. De-la-fortelle, A Belief Propagation Approach to Traffic Prediction using Probe Vehicles, 2007 IEEE Intelligent Transportation Systems Conference, pp.1022-1027, 2007.
DOI : 10.1109/ITSC.2007.4357716

URL : https://hal.archives-ouvertes.fr/hal-00175627

A. Georges, Y. , and J. , How to expand around mean-field theory using high-temperature expansions, Journal of Physics A: Mathematical and General, vol.24, issue.9, p.2173, 1991.
DOI : 10.1088/0305-4470/24/9/024

H. Höfling and R. Tibshirani, Estimation of sparse binary pairwise Markov networks using pseudo-likelihood, pp.883-906, 2009.

J. J. Hopfield, Neural network and physical systems with emergent collective computational abilities, Proc. of Natl. Acad. Sci. USA, pp.2554-2558, 1982.

C. Hsieh, M. Sustik, I. Dhillon, and K. Ravikumar, Sparse inverse covariance matrix estimation using quadratic approximation, Advances in Neural Information Processing Systems 24, pp.2330-2338, 2011.

S. Lee, V. Ganapathi, and D. Koller, Efficient structure learning of Markov networks using L 1 -regularization, NIPS, 2006.

A. Iusem, Augmented Lagrangian methods and proximal point methods for convex optimization, Investigacion Operativa, pp.11-50, 1999.

H. Kappen, R. , and F. , Efficient Learning in Boltzmann Machines Using Linear Response Theory, Neural Computation, vol.4, issue.5, pp.1137-1156, 1998.
DOI : 10.1162/neco.1994.6.3.341

L. Dicker, B. H. , L. , and X. , Variable selection and estimation with the seamless-l 0 penalty, Statistica Sinica In press, 2012.

D. Malioutov, J. Johnson, and A. Willsky, Walk-sums and belief propagation in Gaussian graphical models, The Journal of Machine Learning Research, vol.7, pp.2031-2064, 2006.

V. Martin, J. Lasgouttes, and C. Furtlehner, Encoding dependencies between real-valued observables with a binary latent MRF, 2012.

M. Mézard, M. , and T. , Constraint satisfaction problems and neural networks: A statistical physics perspective, Journal of Physiology-Paris, vol.103, issue.1-2, pp.1-2, 2009.
DOI : 10.1016/j.jphysparis.2009.05.013

T. Mora, Géométrie et inférence dans l'optimisation et en théorie de l'information, Thèse de doctorat, 2007.

P. Netrapalli, S. Banerjee, S. Sanghavi, and S. Shakkottai, Greedy learning of Markov network structure, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp.1295-1302, 2010.
DOI : 10.1109/ALLERTON.2010.5707063

H. Nguyen and J. Berg, Bethe-Peierls approximation and the inverse Ising model, J. Stat. Mech, pp.3501-03004, 1112.

H. Nguyen and J. Berg, Mean-Field Theory for the Inverse Ising Problem at Low Temperatures, Physical Review Letters, vol.109, issue.5, p.50602, 2012.
DOI : 10.1103/PhysRevLett.109.050602

T. Plefka, Convergence condition of the TAP equation for the infinite-ranged Ising spin glass model, Journal of Physics A: Mathematical and General, vol.15, issue.6, 1971.
DOI : 10.1088/0305-4470/15/6/035

R. Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society,Series B, vol.58, pp.267-288, 1996.

M. Welling and Y. Teh, Approximate inference in Boltzmann machines, Artificial Intelligence, vol.143, issue.1, pp.19-50, 2003.
DOI : 10.1016/S0004-3702(02)00361-2

M. Yasuda and K. Tanaka, Approximate Learning Algorithm in Boltzmann Machines, Neural Computation, vol.21, issue.11, pp.3130-3178, 2009.
DOI : 10.1080/14786437708235992

J. S. Yedidia, W. T. Freeman, and Y. Weiss, Generalized belief propagation, Advances in Neural Information Processing Systems, pp.689-695, 2001.