M. Akian, S. Gaubert, and A. Lakhoua, The max-plus finite element method for solving deterministic optimal control problems: basic properties and convergence analysis, SIAM Journal on Control and Optimization, vol.47, issue.2, pp.817-848, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00071395

F. Bach, Max-plus matching pursuit for deterministic Markov decision processes, working paper or preprint, 2019.

V. +-16]-greg-brockman, L. Cheung, J. Pettersson, J. Schneider, J. Schulman et al., OpenAI gym, 2016.

A. Bernstein and N. Shimkin, Adaptive aggregation for reinforcement learning with efficient exploration: Deterministic domains, pp.323-334, 2008.

G. Cohen, S. Gaubert, and J. Quadrat, Duality and separation theorems in idempotent semimodules, Linear Algebra and its Applications, vol.379, pp.395-422, 2004.
URL : https://hal.archives-ouvertes.fr/inria-00071917

R. Finkel and J. Bentley, Quad trees: A data structure for retrieval on composite keys, Acta Inf, vol.4, pp.1-9, 1974.

H. Wendell, H. Fleming, and . Soner, Controlled Markov Processes and Viscosity Solutions, 2006.

S. Gaubert, W. Mceneaney, and Z. Qu, Curse of dimensionality reduction in max-plus based approximation methods: Theoretical estimates and improved pruning algorithms, 50th IEEE Conference on Decision and Control, pp.1054-1061, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00935266

S. Gaubert and M. Plus, Methods and applications of (max,+) linear algebra, Annual Symposium on Theoretical Aspects of Computer Science, pp.261-282, 1997.
URL : https://hal.archives-ouvertes.fr/inria-00073603

O. Hernández, -. Lerma, and J. B. Lasserre, Discrete-time Markov Control Processes: Basic Optimality Criteria, vol.30, 2012.

D. Liberzon, Calculus of Variations and Optimal Control Theory: A Concise Introduction, 2011.

M. William and . Mceneaney, Max-plus eigenvector representations for solution of nonlinear H infinity problems: basic concepts, IEEE Transactions on Automatic Control, vol.48, issue.7, pp.1150-1163, 2003.

R. Munos and A. Moore, Variable resolution discretization in optimal control, Machine Learning, vol.49, issue.2-3, pp.291-323, 2002.

P. Mehta and S. Meyn, Q-learning and Pontryagin's minimum principle, Proceedings of the 48h IEEE Conference on Decision and Control, pp.3598-3605, 2009.

G. Stéphane, Z. Mallat, and . Zhang, Matching pursuits with time-frequency dictionaries, IEEE Transactions on Signal Processing, vol.41, issue.12, pp.3397-3415, 1993.

R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction, 2018.

C. Tallec, L. Blier, and Y. Ollivier, Making deep Q-learning methods robust to time discretization, International Conference on Machine Learning, pp.6096-6104, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02435523