D. Rohde, S. Bonner, T. Dunlop, F. Vasile, and A. Karatzoglou, RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising, 2018.

G. Bresler and M. Karzand, Regret bounds and regimes of optimality for user-user and item-item collaborative filtering, 2018 Information Theory and Applications Workshop, 2018.

P. Kohli, M. Salek, and G. Stoddard, A fast bandit algorithm for recommendations to users with heterogeneous tastes, Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, ser. AAAI'13, pp.1135-1141, 2013.

J. Mary, R. Gaudel, and P. Preux, Bandits and recommender systems, Revised Selected Papers of the First International Workshop on Machine Learning, vol.9432, pp.325-336, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01256033

M. Tokic, Adaptive -greedy exploration in reinforcement learning based on value differences, Proceedings of the 33rd Annual German Conference on Advances in Artificial Intelligence, ser. KI'10, pp.203-210, 2010.

P. Auer, N. Cesa-bianchi, and P. Fischer, Finite-time analysis of the multiarmed bandit problem, Machine Learning, 2002.

P. Auer, N. Cesa-bianchi, Y. Freund, and R. E. Schapire, The Nonstochastic Multiarmed Bandit Problem, SIAM Journal on Computing, 2003.

A. Garivier and E. Moulines, On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00281392

G. Brockman, V. Cheung, L. Pettersson, J. Schneider, J. Schulman et al., Openai gym, 2016.

C. Hartland, S. Gelly, N. Baskiotis, and O. Teytaud, Multi-armed bandit, dynamic environments and meta-bandits, Environments, pp.1-14, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00113668

E. Bigdeli and Z. Bahmani, Comparing accuracy of cosine-based similarity and correlation-based similarity algorithms in tourism recommender systems, 4th IEEE International Conference on Management of Innovation and Technology, pp.469-474, 2008.

J. Leskovec, A. Rajaraman, and J. D. Ullman, Mining of massive datasets, 2014.

C. C. Aggarwal, Recommender systems, 2016.