A. M. Ahrens, B. F. Singer, C. J. Fitzpatrick, J. D. Morrow, and T. E. Robinson, Rats that sign-track are resistant to Pavlovian but not instrumental extinction, Behav Brain Res, vol.296, pp.418-430, 2016.

P. Anselme, Incentive salience attribution under reward uncertainty: a Pavlovian model, Behav Process, vol.111, pp.6-18, 2015.

K. C. Berridge, From prediction error to incentive salience: mesolimbic computation of reward motivation, Eur J Neurosci, vol.35, issue.7, pp.1124-1143, 2012.

F. Cinotti, V. Fresno, N. Aklil, E. Coutureau, B. Girard et al., Dopamine blockade impairs the explorationexploitation trade-off in rats, Scientific Reports, vol.9, issue.1, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02121649

G. Davey, D. Oakley, and G. G. Cleland, Autoshaping in the rat: effect of omission on the form of the response, J Exp Anal Behav, vol.36, issue.1, pp.75-91, 1981.

N. D. Daw, Y. Niv, and P. Dayan, Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control, Nat Neurosci, vol.8, issue.12, pp.1704-1715, 2005.

N. D. Daw, J. P. O'doherty, P. Dayan, R. J. Dolan, and B. Seymour, Cortical substrates for exploratory decisions in humans, Nature, vol.441, issue.7095, pp.876-885, 2006.

P. Dayan and K. C. Berridge, Model-based and model-free Pavlovian reward learning: Revaluation, revision, and revelation, Cogn Affect Behav Neurosci, vol.14, issue.2, pp.473-492, 2014.

R. C. Derman, K. Schneider, S. Juarez, and A. R. Delamater, Signtracking is an expectancy-mediated behavior that relies on prediction error mechanisms, Learn Mem, vol.25, issue.10, pp.550-563, 2018.

L. Dollé, R. Chavarriaga, A. Guillot, and M. Khamassi, Interactions of spatial strategies producing generalization gradient and blocking:, A computational approach, vol.14, 2018.

S. B. Flagel, H. Akil, and T. E. Robinson, Individual differences in the attribution of incentive salience to reward-related cues: implications for addiction, Neuropharmacology, vol.56, pp.139-148, 2009.

S. B. Flagel, J. J. Clark, T. E. Robinson, L. Mayo, A. Czuj et al., A selective role for dopamine in stimulus-reward learning, Nature, vol.469, issue.7328, pp.53-57, 2011.

K. M. Fraser and P. H. Janak, Long-lasting contribution of dopamine in the nucleus accumbens core, but not dorsal lateral striatum, to sign-tracking, Eur J Neurol, vol.46, issue.4, pp.2047-2055, 2017.

P. C. Holland, Conditioned stimulus as a determinant of the form of the Pavlovian conditioned response, J Exp Psychol Anim Behav Process, vol.3, issue.1, pp.77-104, 1977.

P. C. Holland, CS-US interval as a determinant of the form of Pavlovian appetitive conditioned responses, J Exp Psychol Anim Behav Process, vol.6, issue.2, pp.155-74, 1980.

M. Ito and K. Doya, Validation of decision-making models and analysis of decision variables in the rat basal ganglia, J Neurosci Off J Soc Neurosci, vol.29, issue.31, pp.9861-9874, 2009.

H. M. Jenkins and B. R. Moore, The form of the auto-shaped response with food or water reinforcers, J Exp Anal Behav, vol.20, issue.2, pp.163-181, 1973.

S. Kaveri and H. Nakahara, Dual reward prediction components yield Pavlovian sign-and goal-tracking, PLoS ONE, issue.10, p.9, 2014.

M. Keramati, A. Dezfouli, and P. Piray, Speed/accuracy tradeoff between the habitual and the goal-directed processes, PLoS Computational Biology, vol.7, issue.5, 2011.

M. Khamassi, P. Enel, P. F. Dominey, and E. Procyk, Medial prefrontal cortex and the adaptive regulation of reinforcement learning parameters, Prog Brain Res, vol.202, pp.441-464, 2013.
URL : https://hal.archives-ouvertes.fr/hal-01628829

B. Lee, R. N. Gentry, G. B. Bissonette, R. J. Herman, J. J. Mallon et al., Manipulating the revision of reward value during the intertrial interval increases sign tracking and dopamine release, PLoS Biol, vol.16, issue.9, p.2004015, 2018.

F. Lesaint, O. Sigaud, J. J. Clark, S. B. Flagel, and M. Khamassi, Experimental predictions drawn from a computational model of sign-trackers and goal-trackers, J Physiol Paris, vol.109, issue.1-3, pp.78-86, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01219979

F. Lesaint, O. Sigaud, S. B. Flagel, T. E. Robinson, and M. Khamassi, Modelling individual differences in the form of pavlovian conditioned approach responses: a dual learning systems approach with factored representations, PLoS Comput Biol, vol.10, issue.2, 2014.

F. Lesaint, O. Sigaud, and M. Khamassi, Accounting for negative automaintenance in pigeons: A dual learning systems approach and factored representations, PLoS One, vol.9, issue.10, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01219998

P. J. Meyer, V. Lovic, B. T. Saunders, L. M. Yager, S. B. Flagel et al., Quantifying individual variation in the propensity to attribute incentive salience to reward cues, PLoS One, vol.7, issue.6, 2012.

S. E. Morrison, M. A. Bamkole, and S. M. Nicola, Sign tracking, but not goal tracking, is resistant to outcome devaluation, Front Neurosci, vol.9, pp.1-12, 2015.

H. M. Nasser, Y. Chen, K. Fiscella, and D. J. Calu, Individual variability in behavioral flexibility predicts sign-tracking tendency, Front Behav Neurosci, vol.9, pp.1-18, 2015.

E. Patitucci, A. J. Nelson, D. M. Dwyer, and R. C. Honey, The origins of individual differences in how learning is expressed in rats:, A general-process perspective, Journal of Experimental Psychology: Animal Learning and Cognition, vol.42, issue.4, pp.313-324, 2016.

R. A. Rescorla and A. R. Wagner, A theory of pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement, Classical Conditioning II: Current Research and Theory, pp.64-99, 1972.

T. E. Robinson and S. B. Flagel, Dissociating the predictive and incentive motivational properties of reward-related cues through the study of individual differences, Biol Psychiatry, vol.65, issue.10, pp.869-873, 2009.

B. T. Saunders and T. E. Robinson, The role of dopamine in the accumbens core in the expression of Pavlovian-conditioned responses, Eur J Neurosci, vol.36, issue.4, pp.2521-2553, 2012.

B. T. Saunders and T. E. Robinson, Individual variation in resisting temptation:, Implications for addiction, Neurosci Biobehav Rev, vol.37, issue.9, pp.1955-1975, 2013.

S. A. Scülfort, D. Bartsch, and T. Enkel, Dopamine antagonism does not impair learning of Pavlovian conditioned approach to manipulable or non-manipulable cues but biases responding towards goal tracking, Behav Brain Res, vol.314, pp.1-5, 2016.

D. A. Simon, N. Daw, G. Viejo, M. Khamassi, A. Brovelli et al., Modeling choice and reaction time during arbitrary visuomotor learning through the coordination of adaptive working memory and reinforcement learning, Frontiers in Behavioral Neuroscience, vol.9, 2012.

R. C. Wilson, A. Geana, J. M. White, E. A. Ludvig, and J. D. Cohen, Humans use directed and random exploration to solve the exploreexploit dilemma, J exp Psychol Gen, vol.143, issue.6, pp.2074-2081, 2014.

J. Zhang, K. C. Berridge, A. J. Tindell, K. S. Smith, and J. W. Aldridge, A neural computational model of incentive salience, PLoS Comput Biol, vol.5, issue.7, p.1000437, 2009.

, Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations