History-dependent evaluations in POMDPs

Xavier Venel; Bruno Ziliotto

doi:10.1137/20M1332876

Article Dans Une Revue SIAM Journal on Control and Optimization Année : 2021

History-dependent evaluations in POMDPs

(1, 2) , (3, 4, 5)

1
2
3
4
5

Xavier Venel

Fonction : Auteur
PersonId : 8219
IdHAL : xavier-venel
ORCID : 0000-0003-1150-9139
IdRef : 163942404

Centre d'économie de la Sorbonne

Paris School of Economics

Bruno Ziliotto

Fonction : Auteur

CEntre de REcherches en MAthématiques de la DEcision

Centre National de la Recherche Scientifique

Université Paris Sciences et Lettres

Résumé

We consider POMDPs in which the weight of the stage payoff depends on the past sequence of signals and actions occurring in the infinitely repeated problem. We prove that for all epsilon>0, there exists a strategy that is epsilon-optimal for any sequence of weights satisfying a property that interprets as "the decision-maker is patient enough". This unifies and generalizes several results of the literature, and applies notably to POMDPs with limsup payoffs.

Mots clés

Markov decision process partial observation long-run average payoff

Domaines

Optimisation et contrôle [math.OC]

Bruno Ziliotto : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02920560

Soumis le : lundi 24 août 2020-17:45:49

Dernière modification le : vendredi 19 avril 2024-16:18:54

Dates et versions

hal-02920560 , version 1 (24-08-2020)

Identifiants

HAL Id : hal-02920560 , version 1
ARXIV : 2004.08844
DOI : 10.1137/20M1332876

Citer

Xavier Venel, Bruno Ziliotto. History-dependent evaluations in POMDPs. SIAM Journal on Control and Optimization, 2021, 59 (2), pp.1730-1755. ⟨10.1137/20M1332876⟩. ⟨hal-02920560⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS1 ENS-PARIS ENPC CNRS UNIV-DAUPHINE EHESS CES CEREMADE PARISTECH TDS-MACS PSL INRAE JSE2024

55 Consultations

0 Téléchargements

History-dependent evaluations in POMDPs

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager