Optimistic planning for sparsely stochastic systems

Lucian Busoniu; Rémi Munos; Bart de Schutter; Robert Babuska

Communication Dans Un Congrès Année : 2011

Optimistic planning for sparsely stochastic systems

(1) , (2) , (1) , (1)

1
2

Lucian Busoniu

Fonction : Auteur
PersonId : 933138

Delft Center for Systems and Control [Delft]

Rémi Munos

Fonction : Auteur
PersonId : 836863

Sequential Learning

Bart de Schutter

Fonction : Auteur

Delft Center for Systems and Control [Delft]

Robert Babuska

Fonction : Auteur

Delft Center for Systems and Control [Delft]

Résumé

We propose an online planning algorithm for finite action, sparsely stochastic Markov decision processes, in which the random state transitions can only end up in a small number of possible next states. The algorithm builds a planning tree by iteratively expanding states, where each expansion exploits sparsity to add all possible successor states. Each state to expand is actively chosen to improve the knowledge about action quality, and this allows the algorithm to return a good action after a strictly limited number of expansions. More specifically, the active selection method is optimistic in that it chooses the most promising states first, so the novel algorithm is called optimistic planning for sparsely stochastic systems. We note that the new algorithm can also be seen as model-predictive (receding-horizon) control. The algorithm obtains promising numerical results, including the successful online control of a simulated HIV infection with stochastic drug effectiveness.

Domaines

Apprentissage [cs.LG]

Fichier principal

adprl2011.pdf (366.49 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Rémi Munos : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00830125

Soumis le : mardi 4 juin 2013-15:49:46

Dernière modification le : mardi 26 mars 2024-17:44:13

Archivage à long terme le : jeudi 5 septembre 2013-04:22:29

Dates et versions

hal-00830125 , version 1 (04-06-2013)

Identifiants

HAL Id : hal-00830125 , version 1

Citer

Lucian Busoniu, Rémi Munos, Bart de Schutter, Robert Babuska. Optimistic planning for sparsely stochastic systems. IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2011, paris, France. pp.48-55. ⟨hal-00830125⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LAGIS INRIA2

100 Consultations

158 Téléchargements

Optimistic planning for sparsely stochastic systems

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager