Reachability in MDPs: Refining Convergence of Value Iteration - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Reachability in MDPs: Refining Convergence of Value Iteration

Résumé

Markov Decision Processes (MDP) are a widely used model including both non-deterministic and probabilistic choices. Minimal and maximal probabilities to reach a target set of states, with respect to a policy resolving non-determinism, may be computed by several methods including value iteration. This algorithm, easy to implement and efficient in terms of space complexity, consists in iteratively finding the probabilities of paths of increasing length. However, it raises three issues: (1) defining a stopping criterion ensuring a bound on the approximation, (2) analyzing the rate of convergence, and (3) specifying an additional procedure to obtain the exact values once a sufficient number of iterations has been performed. The first two issues are still open and for the third one a "crude" upper bound on the number of iterations has been proposed. Based on a graph analysis and transformation of MDPs, we address these problems. First we introduce an interval iteration algorithm , for which the stopping criterion is straightforward. Then we exhibit convergence rate. Finally we significantly improve the bound on the number of iterations required to get the exact values.
Fichier principal
Vignette du fichier
value-iteration.pdf (384.98 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01091122 , version 1 (04-12-2014)
hal-01091122 , version 2 (12-02-2016)

Licence

Copyright (Tous droits réservés)

Identifiants

Citer

Serge Haddad, Benjamin Monmege. Reachability in MDPs: Refining Convergence of Value Iteration. 8th International Workshop on Reachability Problems (RP'14), Sep 2014, Oxford, United Kingdom. pp.125-137, ⟨10.1007/978-3-319-11439-2_10⟩. ⟨hal-01091122v2⟩
220 Consultations
701 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More