A RANDOM MATRIX APPROACH TO NEURAL NETWORKS

Cosme Louart; Zhenyu Liao; Romain Couillet

doi:10.1214/17-AAP1328

Article Dans Une Revue The Annals of Applied Probability Année : 2018

A RANDOM MATRIX APPROACH TO NEURAL NETWORKS

(1) , , (2)

1
2

Cosme Louart

Fonction : Auteur
PersonId : 1040908
IdRef : 270239219

Laboratoire Vision et Ingénierie des Contenus

Zhenyu Liao

Fonction : Auteur

Romain Couillet

Fonction : Auteur
PersonId : 170874
IdHAL : romain-couillet
ORCID : 0000-0001-5755-2090
IdRef : 15645713X

Laboratoire des signaux et systèmes

Résumé

This article studies the Gram random matrix model G = 1 T Σ T Σ, Σ = σ(W X), classically found in the analysis of random feature maps and random neural networks, where X = [x1,. .. , xT ] ∈ R p×T is a (data) matrix of bounded norm, W ∈ R n×p is a matrix of independent zero-mean unit variance entries, and σ : R → R is a Lipschitz continuous (activation) function-σ(W X) being understood entry-wise. By means of a key concentration of measure lemma arising from non-asymptotic random matrix arguments, we prove that, as n, p, T grow large at the same rate, the resolvent Q = (G + γIT) −1 , for γ > 0, has a similar behavior as that met in sample covariance matrix models, involving notably the moment Φ = T n E[G], which provides in passing a deterministic equivalent for the empirical spectral measure of G. Application-wise, this result enables the estimation of the asymptotic performance of single-layer random neural networks. This in turn provides practical insights into the underlying mechanisms into play in random neural networks, entailing several unexpected consequences, as well as a fast practical means to tune the network hyperparameters.

Domaines

Machine Learning [stat.ML] Probabilités [math.PR]

Fichier principal

1702.05419.pdf (607.65 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Romain Couillet : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01957656

Soumis le : lundi 17 décembre 2018-14:52:23

Dernière modification le : mercredi 3 avril 2024-11:14:12

Archivage à long terme le : lundi 18 mars 2019-15:07:29

Dates et versions

hal-01957656 , version 1 (17-12-2018)

Identifiants

HAL Id : hal-01957656 , version 1
ARXIV : 1702.05419
DOI : 10.1214/17-AAP1328

Citer

Cosme Louart, Zhenyu Liao, Romain Couillet. A RANDOM MATRIX APPROACH TO NEURAL NETWORKS. The Annals of Applied Probability, 2018, 28 (2), pp.1190-1248. ⟨10.1214/17-AAP1328⟩. ⟨hal-01957656⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA CNRS SUP_LSS SUP_SIGNAUX CENTRALESUPELEC DRT CEA-UPSAY UNIV-PARIS-SACLAY LIST GS-ENGINEERING GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT

189 Consultations

166 Téléchargements

A RANDOM MATRIX APPROACH TO NEURAL NETWORKS

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager