A generative spiking neural-network model of goal-directed behaviour and one-step planning

Ruggero Basanisi; Andrea Brovelli; Emilio Cartoni; Gianluca Baldassarre

doi:10.1371/journal.pcbi.1007579

Article Dans Une Revue PLoS Computational Biology Année : 2020

A generative spiking neural-network model of goal-directed behaviour and one-step planning

(1) , (1) , , (2)

1
2

Ruggero Basanisi

Fonction : Auteur
PersonId : 800467
ORCID : 0000-0003-4776-596X

Institut de Neurosciences de la Timone

Andrea Brovelli

Fonction : Auteur
PersonId : 184498
IdHAL : andrea-brovelli
ORCID : 0000-0002-5342-1330
IdRef : 204209064

Institut de Neurosciences de la Timone

Emilio Cartoni

Fonction : Auteur

Gianluca Baldassarre

Fonction : Auteur
PersonId : 777919
ORCID : 0000-0002-1277-4447

Istituto di Scienze e Tecnologie della Cognizione

Résumé

Idea of the model, specification of the model and tests, implementation of the model, tests, data analysis, analysis of results, writing-up. ‡Idea of the model, specification of the model and tests, analysis of results, writing-up. ¤Specification of the model and tests, analysis of results, writing-up. * Abstract In mammals, goal-directed and planning processes support flexible behaviour usable to face new situations or changed conditions that cannot be tackled through more efficient but rigid habitual behaviours. Within the Bayesian modelling approach of brain and behaviour, probabilistic models have been proposed to perform planning as a probabilistic inference. Recently, some models have started to face the important challenge met by this approach: grounding such processes on the computations implemented by brain spiking networks. Here we propose a model of goal-directed behaviour that has a probabilistic interpretation and is centred on a recurrent spiking neural network representing the world model. The model, building on previous proposals on spiking neurons and plasticity rules having a probabilistic interpretation, presents these novelties at the system level: (a) the world model is learnt in parallel with its use for planning, and an arbitration mechanism decides when to exploit the world-model knowledge with planning, or to explore, on the basis of an entropy-based confidence on the world model knowledge; (b) the world model is a hidden Markov model (HMM) able to simulate sequences of states and actions, thus planning selects actions through the same neural generative process used to predict states; (c) the world model learns the hidden causes of observations, and their temporal dependencies, through a biologically plausible unsupervised learning mechanism. The model is tested with a visuomotor learning task and validated by comparing its behaviour with the performance and reaction times of human participants solving the same task. The model represents a further step towards the construction of an autonomous architecture bridging goal-directed behaviour as probabilistic inference to brain-like computations. Author summary Goal-directed behaviour relies on brain processes supporting planning of actions based on the prediction of their consequences before performing them in the environment. An important computational modelling approach of these processes sees the brain as a probabilistic machine implementing goal-directed processes relying on probability distributions and operations on them. An important challenge for this approach is to explain how these distributions and operations might be grounded on the brain spiking doi: bioRxiv preprint neurons and learning processes. Here we propose a hypothesis of how this might happen by presenting a computational model of goal-directed processes based on artificial spiking neural networks. The model presents three main novelties. First, it can plan even while it is still learning the consequences of actions by deciding if planning or exploring the environment based on how confident it is on its predictions. Second, it is able to 'think' alternative possible actions, and their consequences, by relying on the low-level stochasticity of neurons. Third, it can learn to anticipate the consequences of actions in an autonomous fashion based on experience. Overall, the model represents a novel hypothesis on how goal-directed behaviour might rely on the stochastic spiking processes and plasticity mechanisms of the brain neurons.

Domaines

Intelligence artificielle [cs.AI] Neurosciences

Fichier principal

Basanisi.pdf (3.49 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Andrea Brovelli : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02565880

Soumis le : mercredi 6 mai 2020-17:03:05

Dernière modification le : mardi 20 juin 2023-19:16:41

Dates et versions

hal-02565880 , version 1 (06-05-2020)

Identifiants

HAL Id : hal-02565880 , version 1
DOI : 10.1371/journal.pcbi.1007579

Citer

Ruggero Basanisi, Andrea Brovelli, Emilio Cartoni, Gianluca Baldassarre. A generative spiking neural-network model of goal-directed behaviour and one-step planning. PLoS Computational Biology, 2020, 16 (12), ⟨10.1371/journal.pcbi.1007579⟩. ⟨hal-02565880⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-AMU INT ANR NEUROMARSEILLE

156 Consultations

55 Téléchargements

A generative spiking neural-network model of goal-directed behaviour and one-step planning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager