Reinforcement learning model in probalistically rewarded task

Pierre Enel; Mehdi Khamassi; Emmanuel Procyk; Peter Dominey

Communication Dans Un Congrès Année : 2010

Reinforcement learning model in probalistically rewarded task

(1) , (1) , (1) , (1)

Pierre Enel

Fonction : Auteur correspondant
PersonId : 885338

Connectez-vous pour contacter l'auteur

Institut cellule souche et cerveau

Mehdi Khamassi

Fonction : Auteur
PersonId : 186
IdHAL : mehdi-khamassi
ORCID : 0000-0002-2515-1046
IdRef : 12845072X

Institut cellule souche et cerveau

Emmanuel Procyk

Fonction : Auteur
PersonId : 740049
IdHAL : emmanuel-procyk
ORCID : 0000-0001-7486-4993
IdRef : 159250374

Institut cellule souche et cerveau

Peter Dominey

Fonction : Auteur
PersonId : 742047
IdHAL : pfdominey
ORCID : 0000-0002-9318-179X
IdRef : 067732887

Institut cellule souche et cerveau

Résumé

Adapting resource seeking behavior is of primary importance in survival. Then, balancing exploration and exploitation of discovered resources is at the core of adaptation to the environment. The reinforcement learning theoretical framework has been elaborated to formalize such reward seeking behavior. Biologically plausible models based on this algorithm have flourished recently. Among them, a neural network model was developed to investigate the functions of the anterior cingulate cortex (ACC) and the dorsolateral prefrontal cortex (DLPFC) involved in action valuation and action selection, respectively (Khamassi et al., 2010)⁠,. This model propose a method to regulate dynamically the exploration inspired by literature on meta-learning in order to solve dynamically the exploration/exploitation trade-off (Doya, 2002)⁠,. This model performed well in a deterministic problem solving task (PST). Our goal was to demonstrate that the model is generalizable to a more ecological PST with probabilistically dispensed rewards. The model was tested with its preset learning rate / exploration rate / initial action values and then optimized search of the parameters space. The initial values of model's parameters proved to be good however not optimal for the new task. Interestingly, the model's performance is very dependent on the initial action values.

Mots clés

Action selection

Domaines

Neurosciences [q-bio.NC] Neurosciences

Fichier principal

NEUROCOMP2010_0046_3d394d938a755edf24c9a86282b91b2f.pdf (374 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Nicolas Fourcaud-Trocmé : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00553440

Soumis le : mercredi 16 mars 2011-21:53:50

Dernière modification le : mercredi 9 août 2023-13:44:51

Archivage à long terme le : vendredi 17 juin 2011-02:29:16

Dates et versions

hal-00553440 , version 1 (16-03-2011)

Identifiants

HAL Id : hal-00553440 , version 1

Citer

Pierre Enel, Mehdi Khamassi, Emmanuel Procyk, Peter Dominey. Reinforcement learning model in probalistically rewarded task. Cinquième conférence plénière française de Neurosciences Computationnelles, "Neurocomp'10", Aug 2010, Lyon, France. ⟨hal-00553440⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSERM UNIV-LYON1 INRA NEUROCOMP2010 UDL INRAE NEF

184 Consultations

116 Téléchargements

Reinforcement learning model in probalistically rewarded task

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager