Multidimensional Triangulation and Interpolation for Reinforcement Learning, Advances in Neural Information Processing Systems, 1997. ,
Variable Resolution Discretization in Optimal Control, 1999. ,
Variable Resolution Discretization for High-accuracy Solutions of Optimal Control Problems, IJCAI, pp.1348-1355, 1999. ,
A New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC), Journal of Dynamic Systems, Measurement, and Control, vol.97, issue.3, pp.220-227, 1975. ,
DOI : 10.1115/1.3426922
Using Cerebellar Arithmetic Computers, In: AI Expert, vol.7, 1992. ,
Q-Learning in Continuous State and Action Spaces, Australian Joint Conference on Artificial Intelligence, pp.417-428, 1999. ,
DOI : 10.1007/3-540-46695-9_35
Vector Quantization and Signal Compression, 1991. ,
DOI : 10.1007/978-1-4615-3626-0
Reinforcement Learning for Robocupsoccer Keepaway, Adaptive Behavior, vol.3, pp.165-188, 2005. ,
Two steps reinforcement learning, International Journal of Intelligent Systems, vol.43, issue.2, pp.213-245, 2008. ,
DOI : 10.1002/int.20255
Multiresolution state-space discretization method for Q-Learning, 2009 American Control Conference, 2009. ,
DOI : 10.1109/ACC.2009.5160474
Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998. ,
DOI : 10.1109/TNN.1998.712192
Learning from Delayed Rewards, 1989. ,
Monte-Carlo Tree Search in Crazy Stone, In: Game Programming Workshop, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00177155
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm, European Conference on Machine Learning, 2009. ,
DOI : 10.1007/978-3-642-04174-7_20
URL : https://hal.archives-ouvertes.fr/inria-00433866
Algorithms for Infinitely Many-armed Bandits, Advances in Neural Information Processing Systems, 2008. ,
Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search, Proceedings of the 5th International Conference on Computers and Games, 2006. ,
DOI : 10.1007/978-3-540-75538-8_7
URL : https://hal.archives-ouvertes.fr/inria-00116992