General limit value in Dynamic Programming

Jérôme Renault

Pré-Publication, Document De Travail Année : 2013

General limit value in Dynamic Programming

(1)

Jérôme Renault

Fonction : Auteur
PersonId : 21086
IdHAL : jerome-renault
ORCID : 0000-0003-2220-3143
IdRef : 112479952

Groupe de recherche en économie mathématique et quantitative

Résumé

We consider a dynamic programming problem with arbitrary state space and bounded rewards. Is it possible to define in an unique way a limit value for the problem, where the ''patience" of the decision-maker tends to infinity ? We consider, for each evaluation $\theta$ (a probability distribution over positive integers) the value function $v_{\theta}$ of the problem where the weight of any stage $t$ is given by $\theta_t$, and we investigate the uniform convergence of a sequence $(v_{\theta^k})_k$ when the ''impatience" of the evaluations vanishes, in the sense that $\sum_{t} | \theta^k_{t}-\theta^k_{t+1}| \rightarrow_{k \to \infty} 0$. We prove that this uniform convergence happens if and only if the metric space $\{v_{\theta^k}, k\geq 1\}$ is totally bounded. Moreover there exists a particular function $v^*$, independent of the particular chosen sequence $({\theta^k})_k$, such that any limit point of such sequence of value functions is precisely $v^*$. Consequently, while speaking of uniform convergence of the value functions, $v^*$ may be considered as the unique possible limit when the patience of the decision-maker tends to infinity. The result applies in particular to discounted payoffs when the discount factor vanishes, as well as to average payoffs where the number of stages goes to infinity, and also to models with stochastic transitions. We present tractable corollaries, and we discuss counterexamples and a conjecture.

Mots clés

dynamic programming average payoffs discounted payoffs general evaluations limit value vanishing impatience uniform convergence of the values

Domaines

Optimisation et contrôle [math.OC]

Fichier principal

generalimitvalues2.pdf (157.6 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Jérôme Renault : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00769763

Soumis le : jeudi 3 janvier 2013-11:47:23

Dernière modification le : mardi 19 mars 2024-03:10:42

Archivage à long terme le : jeudi 4 avril 2013-03:48:11

Dates et versions

hal-00769763 , version 1 (03-01-2013)

Identifiants

HAL Id : hal-00769763 , version 1
ARXIV : 1301.0451

Citer

Jérôme Renault. General limit value in Dynamic Programming. 2013. ⟨hal-00769763⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS EHESS INRA UT1-CAPITOLE TDS-MACS INRAE

126 Consultations

76 Téléchargements

General limit value in Dynamic Programming

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager