M. González, O. Beaude, P. Bouyer, S. Lasaulce, and N. Markey, Stratégies d'ordonnancement de consommation d'énergie en présence d'information imparfaite de prévision, 2017.

P. Bouyer, M. González, N. Markey, and M. Randour, Multiweighted Markov decision processes with reachability objectives, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01889020

F. J. Beutler and K. W. Ross, Optimal policies for controlled Markov chains with a constraint, vol.112, pp.236-252, 1985.

D. Bertsekas, Convex optimization theory. Belmont : Athena Scientific, 2009.

C. Baier, N. Bertrand, C. Dubslaff, D. Gburek, and O. Sankur, Stochastic shortest paths and weight-bounded properties in Markov decision processes, LICS'18
URL : https://hal.archives-ouvertes.fr/hal-01883409

R. E. Bellman and E. D. Stuart, Applied dynamic programming, vol.2050, 2015.