Training recurrent networks online without backtracking

Yann Ollivier; Guillaume Charpiat

Pré-Publication, Document De Travail Année : 2015

Training recurrent networks online without backtracking

(1, 2) , (2)

1
2

Yann Ollivier

Fonction : Auteur

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Guillaume Charpiat

Fonction : Auteur
PersonId : 6593
IdHAL : guillaume-charpiat
ORCID : 0009-0003-6000-9410
IdRef : 184598230

Machine Learning and Optimisation

Résumé

We introduce the "NoBackTrack" algorithm to train the parameters of dynamical systems such as recurrent neural networks. This algorithm works in an online, memoryless setting, thus requiring no backpropagation through time, and is scalable, avoiding the large computational and memory cost of maintaining the full gradient of the current state with respect to the parameters. The algorithm essentially maintains, at each time, a single search direction in parameter space. The evolution of this search direction is partly stochastic and is constructed in such a way to provide, at every time, an unbiased random estimate of the gradient of the loss function with respect to the parameters. Because the gradient estimate is unbiased, on average over time the parameter is updated as it should. The resulting gradient estimate can then be fed to a lightweight Kalman-like filter to yield an improved algorithm. For recurrent neural networks, the resulting algorithms scale linearly with the number of parameters. Preliminary tests on a simple task show that the stochastic approximation of the gradient introduced in the algorithm does not seem to introduce too much noise in the trajectory, compared to maintaining the full gradient, and confirm the good performance and scalability of the Kalman-like version of NoBackTrack.

Domaines

Probabilités [math.PR]

Yann Ollivier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01228954

Soumis le : lundi 5 février 2018-22:38:22

Dernière modification le : jeudi 18 avril 2024-16:33:54

Dates et versions

hal-01228954 , version 1 (05-02-2018)

Identifiants

HAL Id : hal-01228954 , version 1
ARXIV : 1507.07680

Citer

Yann Ollivier, Guillaume Charpiat. Training recurrent networks online without backtracking. 2015. ⟨hal-01228954⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UMR8623 CENTRALESUPELEC INRIA2 LRI-AO UNIV-PARIS-SACLAY GS-COMPUTER-SCIENCE

187 Consultations

0 Téléchargements

Training recurrent networks online without backtracking

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager