Riemannian metrics for neural networks I: Feedforward networks

Yann Ollivier

doi:10.1093/imaiai/iav006

Article Dans Une Revue Information and Inference Année : 2015

Riemannian metrics for neural networks I: Feedforward networks

(1, 2)

1
2

Yann Ollivier

Fonction : Auteur

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Résumé

We describe four algorithms for neural network training, each adapted to different scalability constraints. These algorithms are mathematically principled and invariant under a number of transformations in data and network representation, from which performance is thus independent. These algorithms are obtained from the setting of differential geometry, and are based on either the natural gradient using the Fisher information matrix, or on Hessian methods, scaled down in a specific way to allow for scalability while keeping some of their key mathematical properties.

Domaines

Réseau de neurones [cs.NE] Apprentissage [cs.LG] Théorie de l'information et codage [math.IT] Théorie de l'information [cs.IT] Géométrie différentielle [math.DG]

Yann Ollivier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00857982

Soumis le : mercredi 4 septembre 2013-13:33:38

Dernière modification le : jeudi 18 avril 2024-16:28:58

Dates et versions

hal-00857982 , version 1 (04-09-2013)

Identifiants

HAL Id : hal-00857982 , version 1
ARXIV : 1303.0818
DOI : 10.1093/imaiai/iav006

Citer

Yann Ollivier. Riemannian metrics for neural networks I: Feedforward networks. Information and Inference, 2015, 4 (2), pp.108-153. ⟨10.1093/imaiai/iav006⟩. ⟨hal-00857982⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UMR8623 CENTRALESUPELEC INRIA2 LRI-AO UNIV-PARIS-SACLAY GS-COMPUTER-SCIENCE

321 Consultations

0 Téléchargements

Riemannian metrics for neural networks I: Feedforward networks

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager