Skip to Main content Skip to Navigation
New interface
Conference papers

Solving Hidden-Semi-Markov-Mode Markov Decision Problems

Emmanuel Hadoux 1 Aurélie Beynier 1 Paul Weng 2 
1 SMA - Systèmes Multi-Agents
LIP6 - Laboratoire d'Informatique de Paris 6
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : Hidden-Mode Markov Decision Processes (HM-MDPs) were proposed to represent sequential decision-making problems in non-stationary environments that evolve according to a Markov chain. We introduce in this paper Hidden-Semi-Markov-Mode Markov Decision Processes (HS3MDPs), a generalization of HM-MDPs to the more realistic case of non-stationary environments evolving according to a semi-Markov chain. Like HM-MDPs, HS3MDPs form a subclass of Partially Observable Markov Decision Processes. Therefore, large instances of HS3MDPs (and HM-MDPs) can be solved using an online algorithm, the Partially Observable Monte Carlo Planning (POMCP) algorithm, based on Monte Carlo Tree Search exploiting particle filters for belief state approximation. We propose a first adaptation of POMCP to solve HS3MDPs more efficiently by exploiting their structure. Our empirical results show that the first adapted POMCP reaches higher cumulative rewards than the original algorithm. However, in larger instances, POMCP may run out of particles. To solve this issue, we propose a second adaptation of POMCP, replacing particle filters by exact representations of beliefs. Our empirical results indicate that this new version reaches high cumulative rewards faster than the former adapted POMCP and still remains efficient even for large problems.
Document type :
Conference papers
Complete list of metadata

Cited literature [16 references]  Display  Hide  Download
Contributor : Emmanuel Hadoux Connect in order to contact the contributor
Submitted on : Thursday, September 17, 2015 - 11:42:02 AM
Last modification on : Sunday, June 26, 2022 - 9:48:21 AM
Long-term archiving on: : Tuesday, December 29, 2015 - 7:43:15 AM


Files produced by the author(s)



Emmanuel Hadoux, Aurélie Beynier, Paul Weng. Solving Hidden-Semi-Markov-Mode Markov Decision Problems. Scalable Uncertainty Management, Sep 2014, Oxford, United Kingdom. pp.176-189, ⟨10.1007/978-3-319-11508-5_15⟩. ⟨hal-01200812⟩



Record views


Files downloads