Q-Learning with Double Progressive Widening : Application to Robotics

Nataliya Sokolovska; Olivier Teytaud; Mario Milone

Communication Dans Un Congrès Année : 2011

Q-Learning with Double Progressive Widening : Application to Robotics

(1, 2) , (1, 2) , (1)

1
2

Nataliya Sokolovska

Fonction : Auteur
PersonId : 879120

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Olivier Teytaud

Fonction : Auteur
PersonId : 581
IdHAL : olivier-teytaud
IdRef : 05971008X

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Mario Milone

Fonction : Auteur

Laboratoire de Recherche en Informatique

Résumé

Discretization of state and action spaces is a critical issue in $Q$-Learning. In our contribution, we propose a real-time adaptation of the discretization by the progressive widening technique which has been already used in bandit-based methods. Results are consistently converging to the optimum of the problem, without changing the parametrization for each new problem.

Domaines

Apprentissage [cs.LG]

Fichier principal

ICONIP-0854.pdf (202.31 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Nataliya Sokolovska : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00624832

Soumis le : mardi 20 septembre 2011-03:23:55

Dernière modification le : lundi 12 février 2024-09:48:04

Archivage à long terme le : mardi 13 novembre 2012-14:00:51

Dates et versions

hal-00624832 , version 1 (20-09-2011)

Identifiants

HAL Id : hal-00624832 , version 1

Citer

Nataliya Sokolovska, Olivier Teytaud, Mario Milone. Q-Learning with Double Progressive Widening : Application to Robotics. ICONIP 2011, Nov 2011, China. pp.103-112. ⟨hal-00624832⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY

328 Consultations

260 Téléchargements

Q-Learning with Double Progressive Widening : Application to Robotics

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager