Quantitative propagation of chaos for mean field Markov decision process with common noise

Médéric Motte; Huyên Pham

Pré-Publication, Document De Travail Année : 2022

Quantitative propagation of chaos for mean field Markov decision process with common noise

(1) , (2, 3)

1
2
3

Médéric Motte

Fonction : Auteur

Laboratoire de Probabilités, Statistique et Modélisation

Huyên Pham

Fonction : Auteur
PersonId : 1057530

Laboratoire de Probabilités, Statistique et Modélisation

UFR Mathématiques [Sciences] - Université Paris Cité

Résumé

We investigate propagation of chaos for mean field Markov Decision Process with common noise (CMKV-MDP), and when the optimization is performed over randomized open-loop controls on infinite horizon. We first state a rate of convergence of order M_N^\gamma , where M_N is the mean rate of convergence in Wasserstein distance of the empirical measure, and γ \in (0,1] 1s is an explicit constant, in the limit of the value functions of N-agent control problem with asymmetric open-loop controls, towards the value function of CMKV-MDP. Furthermore, we show how to explicitly construct O(\epsilon + M_N^\gamma)-optimal policies for the N-agent model from \epsilon-optimal policies for the CMKV-MDP. Our approach relies on sharp comparison between the Bellman operators in the N-agent problem and the CMKV-MDP, and fine coupling of empirical measures.

Domaines

Optimisation et contrôle [math.OC] Probabilités [math.PR]

Fichier principal

Conv-MPjuly2022.pdf (344.39 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Huyên Pham : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03737655

Soumis le : lundi 25 juillet 2022-11:56:38

Dernière modification le : mercredi 3 avril 2024-14:10:02

Dates et versions

hal-03737655 , version 1 (25-07-2022)

Identifiants

HAL Id : hal-03737655 , version 1
ARXIV : 2207.12738

Citer

Médéric Motte, Huyên Pham. Quantitative propagation of chaos for mean field Markov decision process with common noise. 2022. ⟨hal-03737655⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS TDS-MACS LPSM SORBONNE-UNIVERSITE SU-SCIENCES UP-SCIENCES

27 Consultations

19 Téléchargements

Quantitative propagation of chaos for mean field Markov decision process with common noise

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager