Deciding the Value 1 Problem for #-acyclic Partially Observable Markov Decision Processes

Hugo Gimbert; Youssouf Oualhadj

Rapport Année : 2012

Deciding the Value 1 Problem for #-acyclic Partially Observable Markov Decision Processes

(1) , (2, 3)

1
2
3

Hugo Gimbert

Fonction : Auteur
PersonId : 6953
IdHAL : hugo-gimbert
ORCID : 0000-0003-1227-9718
IdRef : 113151918

Laboratoire Bordelais de Recherche en Informatique

Youssouf Oualhadj

Fonction : Auteur
PersonId : 890485

Laboratoire d'informatique Fondamentale de Marseille

Université de Mons

Résumé

The value 1 problem is a natural decision problem in algorithmic game theory. For partially observable Markov decision processes with reachability objective, this problem is defined as follows: are there strategies that achieve the reachability objective with probability arbitrarily close to 1? This problem was shown undecidable recently. Our contribution is to introduce a class of partially observable Markov decision processes, namely #-acyclic partially observable Markov decision processes, for which the value 1 problem is decidable. Our algorithm is based on the construction of a two-player perfect information game, called the knowledge game, abstracting the behaviour of a #-acyclic partially observable Markov decision process M such that the first player has a winning strategy in the knowledge game if and only if the value of M is 1.

Domaines

Informatique et théorie des jeux [cs.GT]

Fichier principal

main.pdf (396.31 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Youssouf Oualhadj : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00743137

Soumis le : jeudi 17 octobre 2013-16:29:11

Dernière modification le : vendredi 24 mars 2023-14:52:57

Archivage à long terme le : vendredi 7 avril 2017-12:53:35

Dates et versions

hal-00743137 , version 1 (18-10-2012)

hal-00743137 , version 2 (19-04-2013)

hal-00743137 , version 3 (17-10-2013)

Identifiants

HAL Id : hal-00743137 , version 3

Citer

Hugo Gimbert, Youssouf Oualhadj. Deciding the Value 1 Problem for #-acyclic Partially Observable Markov Decision Processes. 2012. ⟨hal-00743137v3⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

LIF CNRS UNIV-AMU EC-MARSEILLE LARA LIS-LAB

254 Consultations

132 Téléchargements

Deciding the Value 1 Problem for #-acyclic Partially Observable Markov Decision Processes

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager