Deep neural networks algorithms for stochastic control problems on finite horizon: convergence analysis

Côme Huré; Huyên Pham; Achref Bachouch; Nicolas Langrené

doi:10.1137/20M1316640

Article Dans Une Revue SIAM Journal on Numerical Analysis Année : 2021

Deep neural networks algorithms for stochastic control problems on finite horizon: convergence analysis

(1, 2) , (1, 2) , (3) , (4, 5)

1
2
3
4
5

Côme Huré

Fonction : Auteur

Laboratoire de Probabilités, Statistique et Modélisation

Université Paris Diderot - Paris 7

Huyên Pham

Fonction : Auteur
PersonId : 921733

Laboratoire de Probabilités, Statistique et Modélisation

Université Paris Diderot - Paris 7

Achref Bachouch

Fonction : Auteur
PersonId : 770108
IdRef : 192243799

University of Oslo

Nicolas Langrené

Fonction : Auteur
PersonId : 171091
IdHAL : nicolas-langrene
ORCID : 0000-0001-7601-4618
IdRef : 177095121

Data61 [Canberra]

BNU HKBU United International College

Résumé

This paper develops algorithms for high-dimensional stochastic control problems based on deep learning and dynamic programming. Unlike classical approximate dynamic programming approaches, we first approximate the optimal policy by means of neural networks in the spirit of deep reinforcement learning, and then the value function by Monte Carlo regression. This is achieved in the dynamic programming recursion by performance or hybrid iteration and regress-now methods from numerical probabilities. We provide a theoretical justification of these algorithms. Consistency and rate of convergence for the control and value function estimates are analyzed and expressed in terms of the universal approximation error of the neural networks, and of the statistical error when estimating network function, leaving aside the optimization error. Numerical results on various applications are presented in a companion paper [Deep neural networks algorithms for stochastic control problems on finite horizon: numerical applications, Methodol. Comput. Appl. Probab., 24(1) 143-178 2022] and illustrate the performance of the proposed algorithms.

Mots clés

performance iteration quantization Deep learning dynamic programming regress now convergence analysis statistical risk AMS subject classifications. 65C05 90C39 93E35

Domaines

Probabilités [math.PR] Optimisation et contrôle [math.OC] Machine Learning [stat.ML]

Fichier principal

deepconsto_sinum_final.pdf (454.96 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Huyên Pham : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01949213

Soumis le : mercredi 2 décembre 2020-09:34:15

Dernière modification le : mercredi 3 avril 2024-14:10:02

Dates et versions

hal-01949213 , version 1 (09-12-2018)

hal-01949213 , version 2 (02-12-2020)

Identifiants

HAL Id : hal-01949213 , version 2
ARXIV : 1812.04300
DOI : 10.1137/20M1316640

Citer

Côme Huré, Huyên Pham, Achref Bachouch, Nicolas Langrené. Deep neural networks algorithms for stochastic control problems on finite horizon: convergence analysis. SIAM Journal on Numerical Analysis, 2021, 59 (1), pp.525-557. ⟨10.1137/20M1316640⟩. ⟨hal-01949213v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS TDS-MACS LPSM SORBONNE-UNIVERSITE SU-SCIENCES UP-SCIENCES ANR

269 Consultations

475 Téléchargements

Deep neural networks algorithms for stochastic control problems on finite horizon: convergence analysis

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager