Limits of Multi-Discounted Markov Decision Processes - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

Limits of Multi-Discounted Markov Decision Processes

Résumé

Markov decision processes (MDPs) are controllable discrete event systems with stochastic transitions. The payoff received by the controller can be evaluated in different ways, depending on the payoff function the MDP is equipped with. For example a \emph{mean--payoff} function evaluates average performance, whereas a \emph{discounted} payoff function gives more weights to earlier performance by means of a discount factor. Another well--known example is the \emph{parity} payoff function which is used to encode logical specifications~\cite{dagstuhl}. Surprisingly, parity and mean--payoff MDPs share two non--trivial properties: they both have pure stationary optimal strategies~\cite{CourYan:1990,neyman} and they both are approximable by discounted MDPs with multiple discount factors (multi--discounted MDPs)~\cite{dealf:2003,neyman}. In this paper we unify and generalize these results. We introduce a new class of payoff functions called the priority weighted payoff functions, which are generalization of both parity and mean--payoff functions. We prove that priority weighted MDPs admit optimal strategies that are pure and stationary, and that the priority weighted value of an MDP is the limit of the multi--discounted value when discount factors tend to $0$ simultaneously at various speeds.
Fichier principal
Vignette du fichier
Limits_of_Multidiscounted_MDPs_Gimbert_Zielonka.pdf (312.08 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00140148 , version 1 (05-04-2007)

Identifiants

Citer

Hugo Gimbert, Wieslaw Zielonka. Limits of Multi-Discounted Markov Decision Processes. LICS 07, Jul 2007, Wroclaw, Poland. pp.89-98, ⟨10.1109/LICS.2007.28⟩. ⟨hal-00140148⟩
83 Consultations
130 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More