S. Davies, Multidimensional Triangulation and Interpolation for Reinforcement Learning, Advances in Neural Information Processing Systems, 1997.

R. Munos and A. Moore, Variable Resolution Discretization in Optimal Control, 1999.

R. Munos and A. W. Moore, Variable Resolution Discretization for High-accuracy Solutions of Optimal Control Problems, IJCAI, pp.1348-1355, 1999.

J. S. Albus, A New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC), Journal of Dynamic Systems, Measurement, and Control, vol.97, issue.3, pp.220-227, 1975.
DOI : 10.1115/1.3426922

G. Burgin, Using Cerebellar Arithmetic Computers, In: AI Expert, vol.7, 1992.

C. Gaskett, D. Wettergreen, and A. Zelinsky, Q-Learning in Continuous State and Action Spaces, Australian Joint Conference on Artificial Intelligence, pp.417-428, 1999.
DOI : 10.1007/3-540-46695-9_35

A. Gersho and R. M. Gray, Vector Quantization and Signal Compression, 1991.
DOI : 10.1007/978-1-4615-3626-0

P. Stone, R. S. Sutton, and G. Kuhlmann, Reinforcement Learning for Robocupsoccer Keepaway, Adaptive Behavior, vol.3, pp.165-188, 2005.

F. Fernández and D. Borrajo, Two steps reinforcement learning, International Journal of Intelligent Systems, vol.43, issue.2, pp.213-245, 2008.
DOI : 10.1002/int.20255

A. Lampton and J. Valasek, Multiresolution state-space discretization method for Q-Learning, 2009 American Control Conference, 2009.
DOI : 10.1109/ACC.2009.5160474

R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192

C. J. Watkings, Learning from Delayed Rewards, 1989.

R. Coulom, Monte-Carlo Tree Search in Crazy Stone, In: Game Programming Workshop, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00177155

P. Rolet, M. Sebag, and O. Teytaud, Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm, European Conference on Machine Learning, 2009.
DOI : 10.1007/978-3-642-04174-7_20
URL : https://hal.archives-ouvertes.fr/inria-00433866

Y. Wang, J. Y. Audibert, and R. Munos, Algorithms for Infinitely Many-armed Bandits, Advances in Neural Information Processing Systems, 2008.

R. Coulom, Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search, Proceedings of the 5th International Conference on Computers and Games, 2006.
DOI : 10.1007/978-3-540-75538-8_7
URL : https://hal.archives-ouvertes.fr/inria-00116992