Deciding the Value 1 Problem for #-acyclic Partially Observable Markov Decision Processes - Archive ouverte HAL Accéder directement au contenu
Rapport Année : 2012

Deciding the Value 1 Problem for #-acyclic Partially Observable Markov Decision Processes

Résumé

The value 1 problem is a natural decision problem in algorithmic game theory. For partially observable Markov decision processes with reachability objective, this problem is defined as follows: are there strategies that achieve the reachability objective with probability arbitrarily close to 1? This problem was shown undecidable recently. Our contribution is to introduce a class of partially observable Markov decision processes, namely #-acyclic partially observable Markov decision processes, for which the value 1 problem is decidable. Our algorithm is based on the construction of a two-player perfect information game, called the knowledge game, abstracting the behaviour of a #-acyclic partially observable Markov decision process M such that the first player has a winning strategy in the knowledge game if and only if the value of M is 1.
Fichier principal
Vignette du fichier
main.pdf (396.31 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00743137 , version 1 (18-10-2012)
hal-00743137 , version 2 (19-04-2013)
hal-00743137 , version 3 (17-10-2013)

Identifiants

  • HAL Id : hal-00743137 , version 3

Citer

Hugo Gimbert, Youssouf Oualhadj. Deciding the Value 1 Problem for #-acyclic Partially Observable Markov Decision Processes. 2012. ⟨hal-00743137v3⟩
254 Consultations
132 Téléchargements

Partager

Gmail Facebook X LinkedIn More