N. I. Akhiezer and I. M. Glazman, Theory of linear operators in Hilbert space, 1993.

Z. D. Bai and J. W. Silverstein, No eigenvalues outside the support of the limiting spectral distribution of large dimensional sample covariance matrices, The Annals of Probability, vol.26, pp.316-345, 1998.

Z. D. Bai and J. W. Silverstein, On the signal-to-interference-ratio of CDMA systems in wireless communications, Annals of Applied Probability, vol.17, pp.81-101, 2007.

Z. D. Bai, J. W. Silverstein, F. Benaych-georges, and R. R. Nadakuditi, The singular values and vectors of low rank perturbations of large rectangular random matrices, Spectral analysis of large dimensional random matrices, vol.111, pp.120-135, 2009.

E. Cambria, P. Gastaldo, F. Bisio, and R. Zunino, An ELM-based model for affective analogical reasoning, Neurocomputing, vol.149, pp.443-455, 2015.

A. Choromanska, M. Henaff, M. Mathieu, G. B. Arous, and Y. Lecun, The Loss Surfaces of Multilayer Networks, AISTATS, 2015.

R. Couillet and F. Benaych-georges, Kernel spectral clustering of large dimensional data, Electronic Journal of Statistics, vol.10, pp.1393-1454, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01215343

R. Couillet and A. Kammoun, Random Matrix Improved Subspace Clustering, 2016 Asilomar Conference on Signals, Systems, and Computers, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01633444

R. Couillet, F. Pascal, and J. W. Silverstein, The random matrix regime of Maronna's M-estimator with elliptically distributed samples, Journal of Multivariate Analysis, vol.139, pp.56-78, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01242488

R. Giryes, G. Sapiro, and A. M. Bronstein, Deep Neural Networks with Random Gaussian Weights: A Universal Classification Strategy? IEEE Transactions on Signal Processing, vol.64, pp.3444-3457, 2015.

K. Hornik, M. Stinchcombe, and H. White, Multilayer feedforward networks are universal approximators, Neural networks, vol.2, pp.359-366, 1989.

J. Hoydis, R. Couillet, and M. Debbah, Random beamforming over quasistatic and fading channels: a deterministic equivalent approach, IEEE Transactions on Information Theory, vol.58, pp.6392-6425, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00769412

G. Huang, Q. Zhu, and C. Siew, Extreme learning machine: theory and applications, Neurocomputing, vol.70, pp.489-501, 2006.

G. Huang, H. Zhou, X. Ding, and R. Zhang, Extreme learning machine for regression and multiclass classification. Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on, vol.42, pp.513-529, 2012.

H. Jaeger and H. Haas, Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication, Science, vol.304, pp.78-80, 2004.

A. Kammoun, M. Kharouf, W. Hachem, and J. Najim, A central limit theorem for the sinr at the lmmse estimator output for large-dimensional signals, IEEE Transactions on Information Theory, vol.55, pp.5048-5063, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00328163

N. El-karoui, Concentration of measure and spectra of random matrices: applications to correlation matrices, elliptical distributions and beyond, The Annals of Applied Probability, vol.19, pp.2362-2405, 2009.

N. El-karoui, The spectrum of kernel random matrices, The Annals of Statistics, vol.38, pp.1-50, 2010.

N. El-karoui, Asymptotic behavior of unregularized and ridge-regularized high-dimensional robust regression estimators: rigorous results, 2013.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems 1097-1105, 2012.

Y. Lecun, C. Cortes, and C. Burges, The MNIST database of handwritten digits, 1998.

M. Ledoux, The concentration of measure phenomenon 89, 2005.

P. Loubaton and P. Vallet, Almost sure localization of the eigenvalues in a Gaussian information plus noise model. Application to the spiked models, Electronic Journal of Probability, vol.16, pp.1934-1959, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00692258

X. Mai and R. Couillet, The counterintuitive mechanism of graph-based semisupervised learning in the big data regime, IEEE International Conference on Acoustics, Speech and Signal Processing, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01957754

V. A. Mar?-cenko and L. A. Pastur, Distribution of eigenvalues for some sets of random matrices, Math USSR-Sbornik, vol.1, pp.457-483, 1967.

L. Pastur and M. , Eigenvalue distribution of large random matrices, 2011.

A. Rahimi and B. Recht, Random features for large-scale kernel machines, Advances in neural information processing systems, pp.1177-1184, 2007.

F. Rosenblatt, The perceptron: a probabilistic model for information storage and organization in the brain, Psychological review, vol.65, p.386, 1958.

M. Rudelson and R. Vershynin, Hanson-Wright inequality and sub-Gaussian concentration, Electron. Commun. Probab, vol.18, pp.1-9, 2013.

A. Saxe, P. W. Koh, Z. Chen, M. Bhand, B. Suresh et al., On random weights and unsupervised feature learning, Proceedings of the 28th international conference on machine learning, pp.1089-1096, 2011.

J. Schmidhuber, Deep learning in neural networks: An overview, Neural Networks, vol.61, pp.85-117, 2015.

J. W. Silverstein and Z. D. Bai, On the empirical distribution of eigenvalues of a class of large dimensional random matrices, Journal of Multivariate Analysis, vol.54, pp.175-192, 1995.

J. W. Silverstein and S. Choi, Analysis of the limiting spectral distribution of large dimensional random matrices, Journal of Multivariate Analysis, vol.54, pp.295-309, 1995.

T. Tao, Topics in random matrix theory 132, 2012.

E. C. Titchmarsh, The Theory of Functions, 1939.

R. Vershynin, Introduction to the non-asymptotic analysis of random matrices, Compressed Sensing, pp.210-268, 2012.

C. K. Williams, Computation with infinite neural networks, Neural Computation, vol.10, pp.1203-1216, 1998.

R. D. Yates, A framework for uplink power control in cellular radio systems, IEEE Journal on Selected Areas in Communications, vol.13, pp.1341-1347, 1995.

T. Zhang, X. Cheng, and A. Singer, Marchenko-Pastur Law for Tyler's and Maronna's M-estimators, 2014.

Z. Liao and R. C. , A Large Dimensional Analysis of Least Squares Support Vector Machines, Journal of Machine Learning Research, 2017.
URL : https://hal.archives-ouvertes.fr/hal-02048984