Learning and forgetting using reinforced Bayesian change detection

Vincent Moens; Alexandre Zénon

doi:10.1371/journal.pcbi.1006713

Article Dans Une Revue PLoS Computational Biology Année : 2019

Learning and forgetting using reinforced Bayesian change detection

, (1)

Vincent Moens

Fonction : Auteur

Alexandre Zénon

Fonction : Auteur
PersonId : 741769
IdHAL : alexandre-zenon
ORCID : 0000-0001-7989-1261
IdRef : 227531558

Université Catholique de Louvain = Catholic University of Louvain

Résumé

Agents living in volatile environments must be able to detect changes in contingencies while refraining to adapt to unexpected events that are caused by noise. In Reinforcement Learning (RL) frameworks, this requires learning rates that adapt to past reliability of the model. The observation that behavioural flexibility in animals tends to decrease following prolonged training in stable environment provides experimental evidence for such adaptive learning rates. However, in classical RL models, learning rate is either fixed or scheduled and can thus not adapt dynamically to environmental changes. Here, we propose a new Bayesian learning model, using variational inference, that achieves adaptive change detection by the use of Stabilized Forgetting, updating its current belief based on a mixture of fixed, initial priors and previous posterior beliefs. The weight given to these two sources is optimized alongside the other parameters, allowing the model to adapt dynamically to changes in environmental volatility and to unexpected observations. This approach is used to implement the "critic" of an actor-critic RL model, while the actor samples the resulting value distributions to choose which action to undertake. We show that our model can emulate different adaptation strategies to contingency changes, depending on its prior assumptions of environmental stability, and that model parameters can be fit to real data with high accuracy. The model also exhibits trade-offs between flexibility and computational costs that mirror those observed in real data. Overall, the proposed method provides a general framework to study learning flexibility and decision making in RL contexts.

Domaines

Sciences de l'Homme et Société Sciences du Vivant [q-bio]

Fichier principal

pcbi.1006713.pdf (4.07 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Alexandre Zénon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02407890

Soumis le : lundi 21 décembre 2020-08:34:55

Dernière modification le : mercredi 3 novembre 2021-05:13:58

Dates et versions

hal-02407890 , version 1 (21-12-2020)

Identifiants

HAL Id : hal-02407890 , version 1
DOI : 10.1371/journal.pcbi.1006713

Citer

Vincent Moens, Alexandre Zénon. Learning and forgetting using reinforced Bayesian change detection. PLoS Computational Biology, 2019, 15 (4), pp.e1006713. ⟨10.1371/journal.pcbi.1006713⟩. ⟨hal-02407890⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

20 Consultations

31 Téléchargements

Learning and forgetting using reinforced Bayesian change detection

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Altmetric

Partager