Deciding the Value 1 Problem for sharp-acyclic Partially Observable Markov Decision Processes - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Deciding the Value 1 Problem for sharp-acyclic Partially Observable Markov Decision Processes

Résumé

The value 1 problem is a natural decision problem in algorithmic game theory. For partially observable Markov decision processes with reachability objective, this problem is defined as follows: are there observational strategies that achieve the reachability objective with probability arbitrarily close to 1? This problem was shown undecidable recently. Our contribution is to introduce a class of partially observable Markov decision processes, namely ]-acyclic partially observable Markov decision processes, for which the value 1 problem is decidable. Our algorithm is based on the construction of a two-player perfect information game, called the knowledge game, abstracting the behaviour of a ]-acyclic partially observable Markov decision process M such that the first player has a winning strategy in the knowledge game if and only if the value of M is 1.
Fichier principal
Vignette du fichier
pomdp.pdf (162.2 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01006394 , version 1 (16-06-2014)

Identifiants

Citer

Hugo Gimbert, Youssouf Oualhadj. Deciding the Value 1 Problem for sharp-acyclic Partially Observable Markov Decision Processes. SOFSEM 2014, Jan 2014, Nový Smokovec, Slovakia. pp.281-292, ⟨10.1007/978-3-319-04298-5_25⟩. ⟨hal-01006394⟩

Collections

CNRS
378 Consultations
130 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More