N. Carrara, R. Laroche, J. Bouraoui, T. Urvoy, and O. Pietquin, A Fitted-Q Algorithm for Budgeted MDPs. Workshop on Safety, Risk and Uncertainty in Reinforcement Learning (UAI2018), 2018.
URL : https://hal.archives-ouvertes.fr/hal-01867353

N. Carrara, R. Laroche, and O. Pietquin, Online learning and transfer for user adaptation in dialogue systems, SIGDIAL/SEMDIAL on negotiation dialog, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01557775

N. Casanueva, T. Hain, H. Christensen, R. Marxer, and P. Green, Knowledge transfer between speakers for personalised dialogue management, 2015.
DOI : 10.18653/v1/w15-4603

URL : https://doi.org/10.18653/v1/w15-4603

S. Chandramohan, M. Geist, and O. Pietquin, Optimizing Spoken Dialogue Management from Data Corpora with Fitted Value Iteration, Proceedings of the International Conference on Speech Communication and Technologies, 2010.

A. Genevay and R. Laroche, Transfer learning for user adaptation in spoken dialogue systems, 2016.

J. C. Watkins, C. Dayan, and P. , Q-learning, Machine Learning, pp.279-292, 1992.

A. Lazaric, Transfer in Reinforcement Learning: a Framework and a Survey, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00772626

R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction, 1998.

M. Taylor and P. Stone, Transfer Learning for Reinforcement Learning Domains : A Survey, JMLR, 2009.