A survey of actor-critic reinforcement learning: standard and natural policy gradients - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews Année : 2012

A survey of actor-critic reinforcement learning: standard and natural policy gradients

Résumé

Policy gradient based actor-critic algorithms are amongst the most popular algorithms in the reinforcement learning framework. Their advantage of being able to do policy search using low-variance gradient estimates has made them useful in several real-life applications, such as robotics, power control and finance. Although general surveys on reinforcement learning techniques already exist, no survey is specifically dedicated to actor-critic algorithms in particular. This paper therefore describes the state of the art of actor-critic algorithms, with a focus on methods that can work in an online setting and use function approximation in order to deal with continuous state and action spaces. After starting with a discussion on the concepts of reinforcement learning and the origins of actor-critic algorithms, this paper describes the workings of the natural gradient, which has made its way into many actor-critic algorithms in the past few years. A review of several standard and natural actor-critic algorithms follows and the paper concludes with an overview of application areas and a discussion on open issues.
Fichier principal
Vignette du fichier
ivo_smcc12_survey.pdf (3.1 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00756747 , version 1 (23-11-2012)

Identifiants

Citer

Ivo Grondman, Lucian Busoniu, Gabriel Lopes, Robert Babuska. A survey of actor-critic reinforcement learning: standard and natural policy gradients. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 2012, 42 (6), pp.1291-1307. ⟨10.1109/TSMCC.2012.2218595⟩. ⟨hal-00756747⟩
551 Consultations
6363 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More