Improving the efficiency of online POMDPs by using belief similarity measures, 2013 IEEE International Conference on Robotics and Automation, 2013. ,
Open loop optimistic planning, Proc. of COLT, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-00943119
Pure Exploration in Multi-armed Bandits Problems, Lecture Notes in Computer Science, pp.23-37, 2009. ,
Optimistic planning for markov decision processes, Artificial Intelligence and Statistics, pp.182-189, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00756736
A Dynamic Programming Approach to Viability Problems, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00125423
Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search, Computers and Games, pp.72-83, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00116992
Simple Regret Optimization in Online Planning for Markov Decision Processes, Journal of Artificial Intelligence Research, vol.51, pp.165-205, 2014. ,
Optimism in reinforcement learning and Kullback-Leibler divergence, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton), 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-00476116