G. E. Alexander and M. D. Crutcher, Functional architecture of basal ganglia circuits: Neural substrates of parallel processing, Trends in neurosciences, 1990.

G. E. Alexander, M. D. Crutcher, and M. R. Delong, Chapter 6 basal ganglia-thalamocortical circuits: Parallel substrates for motor, oculomotor, prefrontal and limbic functions, Progress in brain research, pp.119-146, 1991.

G. E. Alexander, M. R. Delong, and P. L. Strick, Parallel organization of functionally segregated circuits linking basal ganglia and cortex, Annual Review of Neuroscience, vol.9, issue.1, pp.357-381, 1986.

S. Amari, Dynamics of pattern formation in lateral-inhibition type neural fields, Biological Cybernetics, vol.27, issue.2, pp.77-87, 1977.

F. G. Ashby, B. O. Turner, and J. C. Horvitz, Cortical and basal ganglia contributions to habit learning and automaticity, Trends in Cognitive Sciences, vol.14, issue.5, pp.208-215, 2010.

P. Auer, N. Cesa-bianchi, Y. Freund, and R. E. Schapire, The nonstochastic multiarmed bandit problem, SIAM Journal on Computing, vol.32, issue.1, pp.48-77, 2002.

I. Bar-gad and H. Bergman, Stepping out of the box: Information processing in the neural networks of the basal ganglia, Current Opinion in Neurobiology, vol.11, issue.6, pp.689-695, 2001.

M. F. Bear and R. C. Malenka, Synaptic plasticity: LTP and LTD, Current Opinion in Neurobiology, vol.4, issue.3, pp.389-399, 1994.

C. M. Bradshaw, E. Szabadi, P. Bevan, and H. V. Ruddle, The effect of signaled reinforcement availability on concurrent performances in humans, Journal of the Experimental Analysis of Behavior, vol.32, issue.1, pp.65-74, 1979.

J. W. Brown, D. Bullock, and S. Grossberg, How laminar frontal cortex and basal ganglia circuits interact to control planned and reactive saccades, Neural Networks, vol.17, issue.4, pp.471-510, 2004.

L. L. Brown, D. M. Smith, and L. M. Goldbloom, Organizing principles of cortical integration in the rat neostriatum: Corticostriate map of the body surface is an ordered lattice of curved laminae and radial points, The Journal of Comparative Neurology, vol.392, issue.4, pp.468-488, 1998.

E. M. Callaway, Local circuits in primary visual cortex of the macaque monkey, Annual Review of Neuroscience, vol.21, issue.1, pp.47-74, 1998.

N. Caporale and Y. Dan, Spike timingdependent plasticity: A hebbian learning rule, Annual Review of Neuroscience, vol.31, issue.1, pp.25-46, 2008.

J. D. Charlesworth, T. L. Warren, and M. S. Brainard, Covert skill learning in a cortical-basal ganglia circuit, Nature, vol.486, issue.7402, pp.251-255, 2012.

R. Coultrip, R. Granger, and G. Lynch, A cortical model of winner-take-all competition via lateral inhibition, Neural Networks, vol.5, issue.1, pp.80006-80007, 1992.

R. L. Cowan and C. J. Wilson, Spontaneous firing patterns and axonal projections of single corticostriatal neurons in the rat medial agranular cortex, Journal of Neurophysiology, vol.71, issue.1, pp.17-32, 1994.

G. Deco, A. Ponce-alvarez, P. Hagmann, G. L. Romani, D. Mantini et al., How local excitationinhibition ratio impacts the whole brain dynamics, Journal of Neuroscience, vol.34, issue.23, pp.7886-7898, 2014.

M. Desmurget and R. S. Turner, Motor sequences and the basal ganglia: Kinematics, not habits, Journal of Neuroscience, vol.30, issue.22, pp.7685-7690, 2010.

A. Dezfouli and B. W. Balleine, Actions, action sequences and habits: Evidence that goal-directed and habitual action control are hierarchically organized, PLoS Computational Biology, vol.9, issue.12, 2013.

B. B. Doll, D. A. Simon, and N. D. Daw, The ubiquity of model-based reinforcement learning, Current Opinion in Neurobiology, vol.22, issue.6, pp.1075-1081, 2012.

J. D. Dougan, F. K. Mcsweeney, and V. A. Farmer, Some parameters of behavioral contrast and allocation of interim behavior in rats, Journal of the Experimental Analysis of Behavior, vol.44, issue.3, pp.44-325, 1985.

K. Doya, Complementary roles of basal ganglia and cerebellum in learning and motor control, Current Opinion in Neurobiology, vol.10, issue.6, pp.732-739, 2000.

K. Doya, Reinforcement learning: Computational theory and biological mechanisms, HFSP Journal, vol.1, issue.1, pp.30-41, 2007.

D. E. Feldman, Synaptic mechanisms for plasticity in neocortex, Annual Review of Neuroscience, vol.32, issue.1, pp.33-55, 2009.

A. W. Flaherty and A. M. Graybiel, Corticostriatal transformations in the primate somatosensory system. projections from physiologically mapped body-part representations, Journal of Neurophysiology, vol.66, issue.4, pp.1249-1263, 1991.

M. J. Frank, By carrot or by stick: Cognitive reinforcement learning in parkinsonism, Science, vol.306, issue.5703, pp.1940-1943, 2004.

L. B. Gilbert-norton, T. A. Shahan, and J. A. Shivik, Coyotes (canis latrans) and the matching law, Behavioural Processes, vol.82, pp.178-183, 2009.

J. C. Gittins, Bandit processes and dynamic allocation indices, Journal of the Royal Statistical Society. Series B (Methodological), vol.41, issue.2, pp.148-177, 1979.

P. W. Glimcher, Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis, Proceedings of the National Academy of Sciences, vol.108, issue.3, pp.15647-15654, 2011.

D. A. Graft, S. E. Lea, and T. L. Whitworth, The matching law in and within groups of rats1, Journal of the Experimental Analysis of Behavior, vol.27, issue.1, pp.183-194, 1977.

A. M. Graybiel, Habits, rituals, and the evaluative brain, Annual Review of Neuroscience, vol.31, issue.1, pp.359-387, 2008.

A. M. Graybiel, T. Aosaki, A. W. Flaherty, and M. Kimura, The basal ganglia and adaptive motor control, Science, vol.265, issue.5180, pp.1826-1831, 1994.

K. Gurney, T. J. Prescott, and P. Redgrave, A computational model of action selection in the basal ganglia. II. analysis and simulation of behaviour, Biological Cybernetics, vol.84, issue.6, pp.411-423, 2001.

M. Guthrie, A. Leblois, A. Garenne, and T. Boraud, Interaction between cognitive and motor cortico-basal ganglia loops during decision making: A computational study, Journal of Neurophysiology, vol.109, pp.3025-3040, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00828004

S. N. Haber, The primate basal ganglia: Parallel and integrative networks, Journal of Chemical Neuroanatomy, vol.26, issue.4, 2003.

S. Hélie, S. W. Ell, and F. G. Ashby, Learning robust cortico-cortical associations with the basal ganglia: An integrative review, Cortex, vol.64, pp.123-135, 2015.

R. J. Herrnstein, Formal properties of the matching law1, Journal of the Experimental Analysis of Behavior, vol.21, issue.1, pp.159-164, 1974.

R. J. Herrnstein, W. Vaughan, D. B. Mumford, and S. M. Kosslyn, Teaching pigeons an abstract relational rule: Insideness, Perception & Psychophysics, vol.46, issue.1, pp.56-64, 1989.

N. Hiratani and T. Fukai, Hebbian wiring plasticity generates efficient network structures for robust inference with synaptic weight plasticity, Frontiers in Neural Circuits, p.10, 2016.

J. J. Hopfield, Neurons with graded response have collective computational properties like those of two-state neurons, Proceedings of the National Academy of Sciences, vol.81, issue.10, pp.3088-3092, 1984.

D. Kase and T. Boraud, Covert learning in the basal ganglia: Raw data. FigShare, 2017.

M. N. Katehakis and A. F. Veinott, The multi-armed bandit problem: Decomposition and computation, Mathematics of Operations Research, vol.12, issue.2, pp.262-268, 1987.
DOI : 10.1287/moor.12.2.262

T. Keasar, Bees in two-armed bandit situations: Foraging choices and possible decision mechanisms, Behavioral Ecology, vol.13, issue.6, pp.757-765, 2002.
DOI : 10.1093/beheco/13.6.757

URL : https://academic.oup.com/beheco/article-pdf/13/6/757/9731751/bhec-13-06-757.pdf

J. N. Kerr and J. R. Wickens, Dopamine d-1/d-5 receptor activation is required for long-term potentiation in the rat neostriatum in vitro, Journal of Neurophysiology, vol.85, issue.1, pp.117-124, 2001.

A. E. Kincaid, T. Zheng, and C. J. Wilson, Connectivity and convergence of single corticostriatal axons, Journal of Neuroscience, vol.18, issue.12, pp.4722-4731, 1998.
DOI : 10.1523/jneurosci.18-12-04722.1998

URL : http://www.jneurosci.org/content/18/12/4722.full.pdf

B. Lau and P. W. Glimcher, Dynamic response-by-response models of matching behavior in rhesus monkeys, Journal of the Experimental Analysis of Behavior, vol.84, issue.3, pp.555-579, 2005.

B. Lau and P. W. Glimcher, Value representations in the primate striatum during matching behavior, Neuron, vol.58, issue.3, pp.451-463, 2008.
DOI : 10.1016/j.neuron.2008.02.021

URL : https://doi.org/10.1016/j.neuron.2008.02.021

A. Leblois, T. Boraud, W. Meissner, H. Bergman, and D. Hansel, Competition between feedback loops underlies normal and pathological dynamics in the basal ganglia, Journal of Neurosciences, vol.26, pp.3567-3583, 2006.
DOI : 10.1523/jneurosci.5050-05.2006

URL : https://hal.archives-ouvertes.fr/hal-00173766

L. R. Matthews and W. Temple, Concurrent schedule assessment of food preference in cows, Journal of the Experimental Analysis of Behavior, vol.32, issue.2, pp.245-254, 1979.
DOI : 10.1901/jeab.1979.32-245

URL : https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1332899/pdf/jeabehav00086-0111.pdf

J. W. Mink, The basal ganglia: Focused selection and inhibition of competing motor programs, Progress in Neurobiology, vol.50, issue.4, pp.381-425, 1996.

M. Mishkin, B. Malamut, and J. Bachevalier, Memories and habits: Two neural systems, Neurobiology of human learning and memory, 1984.

D. R. Muir and M. Cook, Anatomical constraints on lateral competition in columnar cortical architectures, Neural Computation, vol.26, issue.8, pp.1624-1666, 2014.
DOI : 10.1162/neco_a_00613

URL : http://edoc.unibas.ch/41443/1/20160120101438_569f4ffecfe7a.pdf

M. Naruse, M. Berthel, A. Drezet, S. Huant, M. Aono et al., Single-photon decision maker, Scientific Reports, vol.5, issue.1, 2015.
DOI : 10.1038/srep13253

URL : https://hal.archives-ouvertes.fr/hal-01627820

E. S. Nisenbaum and C. J. Wilson, Potassium currents responsible for inward and outward rectification in rat neostriatal spiny projection neurons, Journal of Neuroscience, vol.15, issue.6, pp.4449-4463, 1995.
DOI : 10.1523/jneurosci.15-06-04449.1995

URL : http://www.jneurosci.org/content/15/6/4449.full.pdf

Y. Niv and A. Langdon, Reinforcement learning with Marr. Current Opinion in Behavioral Sciences, vol.11, pp.67-73, 2016.
DOI : 10.1016/j.cobeha.2016.04.005

URL : http://europepmc.org/articles/pmc4939081?pdf=render

R. C. O'reilly and M. J. Frank, Making working memory work: A computational model of learning in the prefrontal cortex and basal ganglia, Neural Computation, vol.18, issue.2, pp.283-328, 2006.

D. E. Oorschot, Total number of neurons in the neostriatal, pallidal, subthalamic, and substantia nigral nuclei of the rat basal ganglia: A stereological study using the cavalieri and optical disector methods, The Journal of Comparative Neurology, vol.366, issue.4, pp.1096-9861, 1996.

M. G. Packard and B. J. Knowlton, Learning and memory functions of the basal ganglia, Annual Review of Neuroscience, vol.25, issue.1, pp.563-593, 2002.

A. Parent, F. Sato, Y. Wu, J. Gauthier, M. Lévesque et al., Organization of the basal ganglia: The importance of axonal collateralization, Trends in Neurosciences, vol.23, pp.20-27, 2000.

H. Parthasarathy, J. Schall, and A. M. Graybiel, Distributed but convergent ordering of corticostriatal projections: Analysis of the frontal eye field and the supplementary eye field in the macaque monkey, Journal of Neuroscience, vol.12, issue.11, pp.4468-4488, 1992.

B. Pasquereau, A. Nadjar, D. Arkadir, E. Bezard, M. Goillandeau et al., Shaping of motor responses by incentive values through the basal ganglia, Journal of Neuroscience, vol.27, issue.5, pp.1176-1183, 2007.

V. Pawlak and J. N. Kerr, Dopamine receptor activation is required for corticostriatal spike-timing-dependent plasticity, Journal of Neuroscience, vol.28, issue.10, pp.2435-2446, 2008.

C. Piron, D. Kase, M. Topalidou, M. Goillandeau, H. Orignac et al., The globus pallidus pars interna in goal-oriented and routine behaviors: Resolving a long-standing paradox, Movement Disorders, issue.8, pp.1146-1154, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01317968

C. Plowright and S. J. Shettleworth, The role of shifting in choice behavior of pigeons on a two-armed bandit, Behavioural Processes, vol.21, issue.2-3, pp.157-178, 1990.

T. Pohlert, The pairwise multiple comparison of mean ranks package (pmcmr). R Package, 2014.

P. Redgrave, K. Gurney, and J. Reynolds, What is reinforced by phasic dopamine signals?, Brain Research Reviews, vol.58, issue.2, pp.322-339, 2008.

C. R. Reid, H. Macdonald, R. P. Mann, J. A. Marshall, T. Latty et al., Decision-making without a brain: How an amoeboid organism solves the two-armed bandit, Journal of The Royal Society Interface, vol.13, issue.119, p.20160030, 2016.

J. N. Reynolds, B. I. Hyland, and J. R. Wickens, A cellular mechanism of reward-related learning, Nature, vol.413, issue.6851, 2001.

H. Robbins, Some aspects of the sequential design of experiments, Bulletin of the American Mathematical Society, vol.58, issue.5, pp.527-536, 1952.

N. P. Rougier and M. Topalidou, Covert learning in the basal ganglia: Code, 2017.

M. I. Sandstrom and G. V. Rebec, Characterization of striatal activity in conscious rats: Contribution of NMDA and AMPA/kainate receptors to both spontaneous and glutamate-driven firing, Synapse, vol.47, issue.2, pp.91-100, 2002.

H. Schroll, J. Vitay, and F. H. Hamker, Dysfunctional and compensatory synaptic plasticity in parkinson's disease, European Journal of Neuroscience, vol.39, issue.4, pp.688-702, 2013.

C. A. Seger and B. J. Spiering, A critical review of habit learning and the basal ganglia. Frontiers in Systems Neuroscience, vol.5, 2011.

O. Shriki, D. Hansel, and H. Sompolinsky, Rate models for conductance-based cortical neuronal networks, Neural Computation, vol.15, issue.8, pp.1809-1841, 2003.
URL : https://hal.archives-ouvertes.fr/hal-00173803

M. Steyvers, M. D. Lee, and E. Wagenmakers, A bayesian analysis of human decision-making on bandit problems, Journal of Mathematical Psychology, vol.53, issue.3, pp.168-179, 2009.

R. E. Suri and W. Schultz, A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task, Neuroscience, vol.91, issue.3, pp.871-890, 1999.

R. E. Suri, TD models of reward predictive responses in dopamine neurons, Neural Networks, vol.15, issue.4-6, pp.523-533, 2002.

D. J. Surmeier, J. Ding, M. Day, Z. Wang, and W. Shen, D1 and d2 dopamine-receptor modulation of striatal glutamatergic signaling in striatal medium spiny neurons, Trends in Neurosciences, vol.30, issue.5, pp.228-235, 2007.

R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction, 1998.

M. Takada, H. Tokuno, I. Hamada, M. Inase, Y. Ito et al., Organization of inputs from cingulate motor areas to basal ganglia in macaque monkey, European Journal of Neuroscience, vol.14, issue.10, pp.1633-1650, 2001.

J. G. Taylor, Neural 'bubble' dynamics in two dimensions, Foundations. Biological Cybernetics, vol.80, issue.6, pp.393-409, 1999.

F. Villagrasa, J. Baladron, J. Vitay, H. Schroll, E. G. Antzoulatos et al., On the role of cortex-basal ganglia interactions for category learning: A neurocomputational approach, The Journal of Neuroscience, vol.38, issue.44, pp.9551-9562, 2018.

C. Von-der-malsburg, Self-organization of orientation sensitive cells in the striate cortex, Kybernetik, vol.14, issue.2, pp.85-100, 1973.

K. Webster, Cortico-striate interrelations in the albino rat, Journal of anatomy, vol.95, pp.532-544, 1961.

C. J. Wilson, Morphology and synaptic connections of crossed corticostriatal neurons in the rat, The Journal of Comparative Neurology, vol.263, issue.4, pp.567-580, 1987.

C. J. Wilson and P. M. Groves, Spontaneous firing patterns of identified spiny neurons in the rat neostriatum, Brain Research, vol.220, issue.1, pp.67-80, 1981.

H. R. Wilson and J. D. Cowan, Excitatory and inhibitory interactions in localized populations of model neurons, Biophysical Journal, vol.12, issue.1, pp.1-24, 1972.

H. R. Wilson and J. D. Cowan, A mathematical theory of the functional dynamics of cortical and thalamic nervous tissue, Kybernetik, vol.13, issue.2, pp.55-80, 1973.

H. H. Yin and B. J. Knowlton, The role of the basal ganglia in habit formation, Nature Reviews Neuroscience, vol.7, issue.6, pp.464-476, 2006.