M. Abramowitz and I. Stegun, Handbook of Mathematical Functions, American Journal of Physics, vol.34, issue.2, 1964.
DOI : 10.1119/1.1972842

A. Alaoui and M. Mahoney, Fast randomized kernel ridge regression with statistical guarantees, Advances in Neural Information Processing Systems, pp.775-783, 2015.

N. Aronszajn, Theory of reproducing kernels, Transactions of the American Mathematical Society, vol.68, issue.3, pp.337-404, 1950.
DOI : 10.1090/S0002-9947-1950-0051437-7

W. V. Assche, Asymptotics for orthogonal polynomials, 1987.

F. Bach, Sharp analysis of low-rank kernel matrix approximations, Conference on Learning Theory, pp.185-209, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00723365

F. Bach, On the equivalence between kernel quadrature rules and random feature expansions, Journal of Machine Learning Research, vol.18, issue.21, pp.1-38, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01118276

L. Bos, Asymptotics for the Christoffel function for Jacobi like weights on a ball in R m, New Zealand Journal of Mathematics, vol.23, issue.99, pp.109-116, 1994.

L. Bos, B. D. Vecchia, and G. Mastroianni, On the asymptotics of Christoffel functions for centrally symmetric weight functions on the ball in R d, Rend. Circ. Mat. Palermo, vol.2, issue.52, pp.277-290, 1998.

S. Chatterjee and A. Hadi, Influential Observations, High Leverage Points, and Outliers in Linear Regression, Statistical Science, vol.1, issue.3, pp.379-393, 1986.
DOI : 10.1214/ss/1177013622

URL : https://doi.org/10.1214/ss/1177013622

K. Clarkson and D. Woodruff, Low rank approximation and regression in input sparsity time, ACM symposium on Theory of computing, pp.81-90, 2013.
DOI : 10.1145/3019134

P. Drineas, M. Magdon-ismail, M. Mahoney, and D. Woodruff, Fast approximation of matrix coherence and statistical leverage, Journal of Machine Learning Research, vol.13, pp.3475-3506, 2012.

C. Dunkl and Y. Xu, Orthogonal polynomials of several variables, 2001.
DOI : 10.1017/cbo9780511565717

URL : http://arxiv.org/pdf/1701.02709

M. Hardy, Combinatorics of partial derivatives. The electronic journal of combinatorics, 2006.

D. Hoaglin and . Welsch, The hat matrix in regression and ANOVA. The American Statistician, pp.17-22, 1978.
DOI : 10.2307/2683469

URL : http://dspace.mit.edu/bitstream/1721.1/1920/1/SWP-0901-02752210.pdf

J. Hunter and B. Nachtergaele, Applied analysis, 2001.
DOI : 10.1142/4319

A. Kroò and D. S. Lubinsky, Christoffel Functions and Universality in the Bulk for Multivariate Orthogonal Polynomials, Journal canadien de math??matiques, vol.65, issue.3, pp.600-620, 2012.
DOI : 10.4153/CJM-2012-016-x

J. Lasserre and E. Pauwels, The empirical Christoffel function in Statistics and Machine Learning, 2017.

P. Ma, M. Mahoney, and B. Yu, A statistical perspective on algorithmic leveraging, The Journal of Machine Learning Research, vol.16, issue.1, pp.861-911, 2015.

M. Mahoney, Randomized Algorithms for Matrices and Data, Machine Learning, pp.123-224, 2011.
DOI : 10.1201/b11822-37

M. Mahoney and P. Drineas, CUR matrix decompositions for improved data analysis, Proceedings of the National Academy of Sciences, pp.697-702, 2009.
DOI : 10.1073/pnas.0500191102

URL : http://www.pnas.org/content/106/3/697.full.pdf

A. Máté and P. Nevai, Bernstein's Inequality in L p for 0 < p < 1 and (C, 1) Bounds for Orthogonal Polynomials, The Annals of Mathematics, vol.111, issue.1, pp.145-154, 1980.
DOI : 10.2307/1971219

A. Máté, P. Nevai, and V. Totik, Szego's Extremum Problem on the Unit Circle, The Annals of Mathematics, vol.134, issue.2, pp.433-53, 1991.
DOI : 10.2307/2944352

S. Minsker, On some extensions of Bernstein's inequality for self-adjoint operators. arXiv preprint, 2011.
DOI : 10.1016/j.spl.2017.03.020

URL : http://arxiv.org/pdf/1112.5448

C. Rasmussen and K. Williams, Gaussian Processes in Machine Learning, 2006.
DOI : 10.1162/089976602317250933

URL : http://mlg.eng.cam.ac.uk/pub/pdf/Ras04.pdf

A. Rudi, R. Camoriano, and L. Rosasco, Less is more: Nyström computational regularization, Advances in Neural Information Processing Systems, pp.1657-1665, 2015.

A. Rudi and L. Rosasco, Generalization properties of learning with random features, Advances in Neural Information Processing Systems, pp.3218-3228, 2017.

B. Schölkopf, R. Herbrich, and A. Smola, A Generalized Representer Theorem, International conference on computational learning theory, pp.416-426, 2001.
DOI : 10.1007/3-540-44581-1_27

B. Sriperumbudur, K. Fukumizu, and G. Lanckriet, On the relation between universality, characteristic kernels and RKHS embedding of measures, Thirteenth International Conference on Artificial Intelligence and Statistics, pp.773-780, 2010.

G. Szegö, Orthogonal polynomials, Colloquium publications, 1974.
DOI : 10.1090/coll/023

V. Totik, Asymptotics for Christoffel functions for general measures on the real line, Journal d'Analyse Math??matique, vol.48, issue.1, pp.283-303, 2000.
DOI : 10.1017/CBO9780511759420

P. Velleman and R. Welsch, Efficient computing of regression diagnostics. The American Statistician, pp.234-242, 1981.
DOI : 10.2307/2683296

E. D. Vito, L. Rosasco, and A. Toigo, Learning sets with separating kernels, Applied and Computational Harmonic Analysis, vol.37, issue.2, pp.185-217, 2014.
DOI : 10.1016/j.acha.2013.11.003

S. Wang and Z. Zhang, Improving cur matrix decomposition and the nyström approximation via adaptive sampling, The Journal of Machine Learning Research, vol.14, issue.1, pp.2729-2769, 2013.

Y. Xu, Asymptotics for orthogonal polynomials and Christoffel functions on a ball, Methods and Applications of Analysis, vol.3, issue.2, pp.257-272, 1996.
DOI : 10.4310/MAA.1996.v3.n2.a6

Y. Xu, Asymptotics of the Christoffel Functions on a Simplex in Rd, Journal of Approximation Theory, vol.99, issue.1, pp.122-133, 1999.
DOI : 10.1006/jath.1998.3312