C. Amiez, J. Joseph, and E. Procyk, Anterior cingulate error-related activity is modulated by predicted reward, European Journal of Neuroscience, vol.7, issue.12, pp.3447-3452, 2005.
DOI : 10.1111/j.1460-9568.2005.04170.x

URL : https://hal.archives-ouvertes.fr/inserm-00132130

C. Amiez, J. Joseph, and E. Procyk, Reward Encoding in the Monkey Anterior Cingulate Cortex, Cerebral Cortex, vol.16, issue.7, pp.1040-1055, 2006.
DOI : 10.1093/cercor/bhj046

URL : https://hal.archives-ouvertes.fr/inserm-00132137

M. Arbib, G. Metta, and P. Van-der-smagt, Neurorobotics: From Vision to Action, pp.1453-1480, 2008.
DOI : 10.1007/978-3-540-30301-5_63

G. Aston-jones and J. D. Cohen, Adaptive gain and the role of the locus coeruleus-norepinephrine system in optimal performance, The Journal of Comparative Neurology, vol.30, issue.1, pp.99-110, 2005.
DOI : 10.1002/cne.20723

P. Auer, N. Cesa-bianchi, and P. Fischer, Finite-time analysis of the multiarmed bandit, Machine Learning, vol.47, issue.2/3, pp.235-256, 2002.
DOI : 10.1023/A:1013689704352

C. W. Berridge and B. D. Waterhouse, The locus coeruleus???noradrenergic system: modulation of behavioral state and state-dependent cognitive processes, Brain Research Reviews, vol.42, issue.1, pp.33-84, 2003.
DOI : 10.1016/S0165-0173(03)00143-7

M. M. Botvinick, T. S. Braver, D. M. Barch, C. S. Carter, and J. D. Cohen, Conflict monitoring and cognitive control., Psychological Review, vol.108, issue.3, pp.624-652, 2001.
DOI : 10.1037/0033-295X.108.3.624

J. W. Brown and T. S. Braver, Learned Predictions of Error Likelihood in the Anterior Cingulate Cortex, Science, vol.307, issue.5712, pp.1118-1121, 2005.
DOI : 10.1126/science.1105783

J. D. Cohen, G. Aston-jones, and S. Gilzenut, A systems-level perspective on attention and cognitive control, Cognitive Neuroscience of Attention ed. M. Posner, pp.71-90, 2004.

J. D. Cohen, S. M. Mcclure, Y. , and A. J. , Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration, Philosophical Transactions of the Royal Society B: Biological Sciences, vol.46, issue.4, pp.933-942, 2007.
DOI : 10.1037/0033-295X.111.4.939

A. Delorme, J. Gautrais, R. Van-rullen, T. , and S. , SpikeNET: A simulator for modeling large networks of integrate and fire neurons, Neurocomputing, vol.26, issue.27, pp.989-996, 1999.
DOI : 10.1016/S0925-2312(99)00095-8

P. F. Dominey, M. Arbib, J. , and J. , A Model of Corticostriatal Plasticity for Learning Oculomotor Associations and Sequences, Journal of Cognitive Neuroscience, vol.28, issue.12, pp.311-336, 1995.
DOI : 10.1146/annurev.ps.40.020189.001203

P. F. Dominey, A. Mallet, Y. , and E. , REAL-TIME SPOKEN-LANGUAGE PROGRAMMING FOR COOPERATIVE INTERACTION WITH A HUMANOID APPRENTICE, International Journal of Humanoid Robotics, vol.06, issue.02, pp.147-171, 2009.
DOI : 10.1142/S0219843609001711

K. Doya, Metalearning and neuromodulation, Neural Networks, vol.15, issue.4-6, pp.495-506, 2002.
DOI : 10.1016/S0893-6080(02)00044-8

K. Fluxe, T. Hokfelt, O. Johansson, G. Jonsson, P. Lidbrink et al., The origin of the dopamine nerve terminals in limbic and frontal cortex. Evidence for mesocortico dopamine neurons, Brain Res, vol.82, pp.349-355, 1974.

M. J. Frank, B. B. Doll, J. Oas-terpstra, and F. Moreno, Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation, Nature Neuroscience, vol.23, issue.8, pp.1062-1068, 2009.
DOI : 10.1093/nar/29.17.e88

J. C. Horvitz, Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events, Neuroscience, vol.96, issue.4, pp.651-656, 2000.
DOI : 10.1016/S0306-4522(00)00019-1

M. D. Humphries, R. D. Stewart, and K. N. Gurney, A Physiologically Plausible Model of Action Selection and Oscillatory Activity in the Basal Ganglia, Journal of Neuroscience, vol.26, issue.50, pp.12921-12942, 2006.
DOI : 10.1523/JNEUROSCI.3486-06.2006

S. W. Kennerley, M. E. Walton, T. E. Behrens, M. J. Buckley, and M. F. Rushworth, Optimal decision making and the anterior cingulate cortex, Nature Neuroscience, vol.336, issue.7, pp.940-947, 2006.
DOI : 10.1038/nn1724

M. Khamassi, L. Martinet, and A. Guillot, Combining selforganizing maps with mixtures of experts: application to an actorcritic model of reinforcement learning in the basal ganglia, From Animals to Animats 9: Proceedings of the Ninth International Conference on Simulation of Adaptive Behavior (SAB), pp.394-405, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00688933

M. Khamassi, R. Quilodran, P. Enel, E. Procyk, and P. F. Dominey, A Computational Model of Integration between Reinforcement Learning and Task Monitoring in the Prefrontal Cortex, From Animals to Animats 11: Proceedings of the Eleventh International Conference on Simulation of Adaptive Behavior (SAB), pp.424-434, 2010.
DOI : 10.1007/978-3-642-15193-4_40

URL : https://hal.archives-ouvertes.fr/inserm-00548868

M. Khamassi, C. Wilson, R. Rothé, R. Quilodran, P. F. Dominey et al., Metalearning , cognitive control, and physiological interactions between medial and lateral prefrontal cortex, Neural Basis of Motivational and Cognitive Control Mars, J. Sallet, M. Rushworth, and N. Yeung
DOI : 10.7551/mitpress/9780262016438.003.0019

E. Koechlin and C. Summerfield, An information theoretical approach to prefrontal executive function, Trends in Cognitive Sciences, vol.11, issue.6, pp.229-235, 2007.
DOI : 10.1016/j.tics.2007.04.005

F. Kouneiher, S. Charron, and E. Koechlin, Motivation and cognitive control in the human prefrontal cortex, Nature Neuroscience, vol.19, issue.7, pp.939-945, 2009.
DOI : 10.1016/j.neuroimage.2004.07.041

J. L. Krichmar, The Neuromodulatory System: A Framework for Survival and Adaptive Behavior in a Challenging World, Adaptive Behavior, vol.46, issue.6, pp.385-399, 2008.
DOI : 10.1177/1059712308095775

S. Lallée, C. Madden, M. Hoen, and P. F. Dominey, Linking language with embodied and teleological representations of action for humanoid cognition, Frontiers in Neurorobotics, 2010.
DOI : 10.3389/fnbot.2010.00008

D. Lee, M. F. Rushworth, M. E. Walton, M. Watanabe, and M. Sakagami, Functional Specialization of the Primate Frontal Cortex during Decision Making, Journal of Neuroscience, vol.27, issue.31, pp.8170-8173, 2007.
DOI : 10.1523/JNEUROSCI.1561-07.2007

M. Matsumoto, K. Matsumoto, H. Abe, and K. Tanaka, Medial prefrontal cell activity signaling prediction errors of action values, Nature Neuroscience, vol.93, issue.5, pp.647-656, 2007.
DOI : 10.1126/science.1069504

S. M. Mcclure, M. S. Gilzenrat, J. D. Cohen, B. Weiss, J. Sholkopf et al., An exploration?exploitation model based on norepinephrine and dopamine activity, Advances in neural information processing systems (NIPS), pp.867-874, 2006.

G. Metta, P. Fitzpatrick, N. , and L. , YARP: Yet Another Robot Platform, International Journal of Advanced Robotic Systems, vol.35, issue.2, pp.43-48, 2006.
DOI : 10.5772/5761

URL : http://doi.org/10.5772/5761

J. Meyer and A. Guillot, Biologically Inspired Robots, Handbook of Robotics, pp.1395-1422, 2008.
DOI : 10.1007/978-3-540-30301-5_61

R. Pfeifer, M. Lungarella, and F. Iida, Self-Organization, Embodiment, and Biologically Inspired Robotics, Science, vol.318, issue.5853, pp.1088-1093, 2007.
DOI : 10.1126/science.1145803

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.377.1611

E. Procyk and P. S. Goldman-rakic, Modulation of Dorsolateral Prefrontal Delay Activity during Self-Organized Behavior, Journal of Neuroscience, vol.26, issue.44, pp.11313-11323, 2006.
DOI : 10.1523/JNEUROSCI.2157-06.2006

URL : https://hal.archives-ouvertes.fr/inserm-00132158

E. Procyk, Y. L. Tanaka, J. , and J. P. , Anterior cingulate activity during routine and non-routine sequential behaviors in macaques, Nature Neuroscience, vol.3, issue.5, pp.502-508, 2000.
DOI : 10.1038/74880

URL : https://hal.archives-ouvertes.fr/inserm-00132133

R. Quilodran, M. Rothe, and E. Procyk, Behavioral Shifts and Action Valuation in the Anterior Cingulate Cortex, Neuron, vol.57, issue.2, pp.314-325, 2008.
DOI : 10.1016/j.neuron.2007.11.031

URL : https://hal.archives-ouvertes.fr/inserm-00906686

P. Redgrave and K. N. Gurney, The short-latency dopamine signal: a role in discovering novel actions?, Nature Reviews Neuroscience, vol.9, issue.4, pp.967-975, 2006.
DOI : 10.1038/nrn2022

P. Redgrave, K. N. Gurney, R. , and J. , What is reinforced by phasic dopamine signals?, Brain Research Reviews, vol.58, issue.2, pp.322-339, 2008.
DOI : 10.1016/j.brainresrev.2007.10.007

M. F. Rushworth and T. E. Behrens, Choice, uncertainty and value in prefrontal and cingulate cortex, Nature Neuroscience, vol.9, issue.4, pp.389-397, 2008.
DOI : 10.1038/nn2066

M. F. Rushworth, T. E. Behrens, P. H. Rudebeck, W. , and M. E. , Contrasting roles for cingulate and orbitofrontal cortex in decisions and social behaviour, Trends in Cognitive Sciences, vol.11, issue.4, pp.168-176, 2007.
DOI : 10.1016/j.tics.2007.01.004

J. Sallet, R. Quilodran, M. Rothé, J. Vezoli, J. P. Joseph et al., Expectations, gains, and losses in the anterior cingulate cortex, Cognitive, Affective, & Behavioral Neuroscience, vol.7, issue.4, pp.327-336, 2007.
DOI : 10.3758/CABN.7.4.327

URL : https://hal.archives-ouvertes.fr/inserm-00256218

W. Schultz, P. Dayan, M. , and P. R. , A Neural Substrate of Prediction and Reward, Science, vol.275, issue.5306, pp.1593-1599, 1997.
DOI : 10.1126/science.275.5306.1593

H. Seo, L. , and D. , Temporal Filtering of Reward Signals in the Dorsal Anterior Cingulate Cortex during a Mixed-Strategy Game, Journal of Neuroscience, vol.27, issue.31, pp.8366-8377, 2007.
DOI : 10.1523/JNEUROSCI.2369-07.2007

R. Sutton, A. Barto, N. G. Tsagarakis, G. Metta, G. Sandini et al., Reinforcement Learning: An Introduction iCub ? the design and realization of an open humanoid platform for cognitive and neuroscience research, Adv. Robot, vol.21, pp.1151-1175, 1998.
DOI : 10.1007/978-1-4615-3618-5

A. Weitzenfeld, M. A. Arbib, A. , and A. , The Neural Simulation Language: A System for Brain Modeling, 2002.

A. J. Yu and P. Dayan, Uncertainty, Neuromodulation, and Attention, Neuron, vol.46, issue.4, pp.681-692, 2005.
DOI : 10.1016/j.neuron.2005.04.026

URL : http://doi.org/10.1016/j.neuron.2005.04.026