The max-plus finite element method for solving deterministic optimal control problems: basic properties and convergence analysis, SIAM Journal on Control and Optimization, vol.47, issue.2, pp.817-848, 2008. ,
URL : https://hal.archives-ouvertes.fr/inria-00071395
Max-plus matching pursuit for deterministic Markov decision processes, working paper or preprint, 2019. ,
, OpenAI gym, 2016.
Adaptive aggregation for reinforcement learning with efficient exploration: Deterministic domains, pp.323-334, 2008. ,
Duality and separation theorems in idempotent semimodules, Linear Algebra and its Applications, vol.379, pp.395-422, 2004. ,
URL : https://hal.archives-ouvertes.fr/inria-00071917
Quad trees: A data structure for retrieval on composite keys, Acta Inf, vol.4, pp.1-9, 1974. ,
, Controlled Markov Processes and Viscosity Solutions, 2006.
Curse of dimensionality reduction in max-plus based approximation methods: Theoretical estimates and improved pruning algorithms, 50th IEEE Conference on Decision and Control, pp.1054-1061, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00935266
Methods and applications of (max,+) linear algebra, Annual Symposium on Theoretical Aspects of Computer Science, pp.261-282, 1997. ,
URL : https://hal.archives-ouvertes.fr/inria-00073603
, Discrete-time Markov Control Processes: Basic Optimality Criteria, vol.30, 2012.
Calculus of Variations and Optimal Control Theory: A Concise Introduction, 2011. ,
Max-plus eigenvector representations for solution of nonlinear H infinity problems: basic concepts, IEEE Transactions on Automatic Control, vol.48, issue.7, pp.1150-1163, 2003. ,
Variable resolution discretization in optimal control, Machine Learning, vol.49, issue.2-3, pp.291-323, 2002. ,
Q-learning and Pontryagin's minimum principle, Proceedings of the 48h IEEE Conference on Decision and Control, pp.3598-3605, 2009. ,
Matching pursuits with time-frequency dictionaries, IEEE Transactions on Signal Processing, vol.41, issue.12, pp.3397-3415, 1993. ,
Reinforcement learning: An introduction, 2018. ,
Making deep Q-learning methods robust to time discretization, International Conference on Machine Learning, pp.6096-6104, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02435523