Average control of Markov decision processes with Feller transition probabilities and general action spaces

Oswaldo Costa; François Dufour

doi:10.1016/j.jmaa.2012.05.073

Article Dans Une Revue Australian Journal of Mathematical Analysis and Applications Année : 2012

Average control of Markov decision processes with Feller transition probabilities and general action spaces

(1) , (2, 3)

1
2
3

Oswaldo Costa

Fonction : Auteur

Universidade de São Paulo = University of São Paulo

François Dufour

Fonction : Auteur
PersonId : 12044
IdHAL : francois-dufour
ORCID : 0000-0001-6653-2024
IdRef : 127261680

Institut de Mathématiques de Bordeaux

Quality control and dynamic reliability

Résumé

This paper studies the average control problem of discrete-time Markov Decision Processes (MDPs for short) with general state space, Feller transition probabilities, and possibly non-compact control constraint sets A(x). Two hypotheses are considered: either the cost function c is strictly unbounded or the multifunctions A(r)(x) = {a is an element of A(x) : c(x, a) <= r} are upper-semicontinuous and compact-valued for each real r. For these two cases we provide new results for the existence of a solution to the average-cost optimality equality and inequality using the vanishing discount approach. We also study the convergence of the policy iteration approach under these conditions. It should be pointed out that we do not make any assumptions regarding the convergence and the continuity of the limit function generated by the sequence of relative difference of the alpha-discounted value functions and the Poisson equations as often encountered in the literature.

Domaines

Probabilités [math.PR]

Benoîte de Saporta : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00938889

Soumis le : mercredi 29 janvier 2014-16:49:07

Dernière modification le : jeudi 4 avril 2024-03:07:19

Dates et versions

hal-00938889 , version 1 (29-01-2014)

Identifiants

HAL Id : hal-00938889 , version 1
DOI : 10.1016/j.jmaa.2012.05.073

Citer

Oswaldo Costa, François Dufour. Average control of Markov decision processes with Feller transition probabilities and general action spaces. Australian Journal of Mathematical Analysis and Applications, 2012, 396 (1), pp.58-69. ⟨10.1016/j.jmaa.2012.05.073⟩. ⟨hal-00938889⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA IMB INRIA2

79 Consultations

0 Téléchargements

Average control of Markov decision processes with Feller transition probabilities and general action spaces

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager