Deciding the Value 1 Problem for sharp-acyclic Partially Observable Markov Decision Processes

Hugo Gimbert; Youssouf Oualhadj

doi:10.1007/978-3-319-04298-5_25

Communication Dans Un Congrès Année : 2014

Deciding the Value 1 Problem for sharp-acyclic Partially Observable Markov Decision Processes

(1) , (1, 2)

1
2

Hugo Gimbert

Fonction : Auteur
PersonId : 6953
IdHAL : hugo-gimbert
ORCID : 0000-0003-1227-9718
IdRef : 113151918

Laboratoire Bordelais de Recherche en Informatique

Youssouf Oualhadj

Fonction : Auteur

Laboratoire Bordelais de Recherche en Informatique

Institut de Mathématiques [Mons]

Résumé

The value 1 problem is a natural decision problem in algorithmic game theory. For partially observable Markov decision processes with reachability objective, this problem is defined as follows: are there observational strategies that achieve the reachability objective with probability arbitrarily close to 1? This problem was shown undecidable recently. Our contribution is to introduce a class of partially observable Markov decision processes, namely ]-acyclic partially observable Markov decision processes, for which the value 1 problem is decidable. Our algorithm is based on the construction of a two-player perfect information game, called the knowledge game, abstracting the behaviour of a ]-acyclic partially observable Markov decision process M such that the first player has a winning strategy in the knowledge game if and only if the value of M is 1.

Domaines

Informatique et théorie des jeux [cs.GT]

Fichier principal

pomdp.pdf (162.2 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Hugo Gimbert : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01006394

Soumis le : lundi 16 juin 2014-08:38:09

Dernière modification le : vendredi 24 mars 2023-14:52:58

Archivage à long terme le : mardi 16 septembre 2014-11:01:07

Dates et versions

hal-01006394 , version 1 (16-06-2014)

Identifiants

HAL Id : hal-01006394 , version 1
DOI : 10.1007/978-3-319-04298-5_25

Citer

Hugo Gimbert, Youssouf Oualhadj. Deciding the Value 1 Problem for sharp-acyclic Partially Observable Markov Decision Processes. SOFSEM 2014, Jan 2014, Nový Smokovec, Slovakia. pp.281-292, ⟨10.1007/978-3-319-04298-5_25⟩. ⟨hal-01006394⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS

378 Consultations

130 Téléchargements

Deciding the Value 1 Problem for sharp-acyclic Partially Observable Markov Decision Processes

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager