Q-Learning with Double Progressive Widening : Application to Robotics - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Q-Learning with Double Progressive Widening : Application to Robotics

Résumé

Discretization of state and action spaces is a critical issue in $Q$-Learning. In our contribution, we propose a real-time adaptation of the discretization by the progressive widening technique which has been already used in bandit-based methods. Results are consistently converging to the optimum of the problem, without changing the parametrization for each new problem.
Fichier principal
Vignette du fichier
ICONIP-0854.pdf (202.31 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00624832 , version 1 (20-09-2011)

Identifiants

  • HAL Id : hal-00624832 , version 1

Citer

Nataliya Sokolovska, Olivier Teytaud, Mario Milone. Q-Learning with Double Progressive Widening : Application to Robotics. ICONIP 2011, Nov 2011, China. pp.103-112. ⟨hal-00624832⟩
328 Consultations
260 Téléchargements

Partager

Gmail Facebook X LinkedIn More