A cautious approach to generalization in reinforcement learning, Proceedings of The International Conference on Agents and Artificial Intelligence, pp.64-73, 2010. ,
Towards Min Max Generalization in Reinforcement Learning, Agents and Artificial Intelligence : International Conference, ICAART 2010, pp.61-77, 2011. ,
DOI : 10.1109/TIT.1967.1054010
Bandit Based Monte-Carlo Planning, ECML-06. Number 4212 in LCNS, pp.282-293, 2006. ,
DOI : 10.1007/11871842_29
Asymptotically efficient adaptive allocation rules, Advances in Applied Mathematics, vol.6, issue.1, pp.4-22, 1985. ,
DOI : 10.1016/0196-8858(85)90002-8
Finite-time analysis of the multiarmed bandit problem, Machine Learning, vol.47, issue.2/3, pp.235-256, 2002. ,
DOI : 10.1023/A:1013689704352
Modification of UCT with patterns in Monte- Carlo Go, 2006. ,
URL : https://hal.archives-ouvertes.fr/inria-00117266
A training algorithm for optimal margin classifiers, Proceedings of the fifth annual workshop on Computational learning theory , COLT '92, pp.144-152, 1992. ,
DOI : 10.1145/130385.130401
Support-vector networks, Machine Learning, pp.273-297, 1995. ,
DOI : 10.1007/BF00994018
Generalization in reinforcement learning : Successful examples using space coarse coding, Advances in Neural Information Precessing Systems, pp.1038-1044, 2006. ,
Estimating the Support of a High-Dimensional Distribution, Neural Computation, vol.6, issue.1, pp.1443-1471, 2001. ,
DOI : 10.1214/aos/1069362732
LIBSVM, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, pp.1-27, 2011. ,
DOI : 10.1145/1961189.1961199
Combining online and offline knowledge in UCT, Proceedings of the 24th international conference on Machine learning, ICML '07, pp.273-280, 2007. ,
DOI : 10.1145/1273496.1273531
URL : https://hal.archives-ouvertes.fr/inria-00164003
Monte-Carlo Tree Search in Settlers of Catan, Advances in Computer Games. 12th International Conference , ACG2009, pp.21-32, 2010. ,
DOI : 10.1007/978-3-642-12993-3_3