Bootstrapping Q-Learning for Robotics from Neuro-Evolution Results

Matthieu Zimmer; Stephane Doncieux

doi:10.1109/TCDS.2016.2628817

Article Dans Une Revue IEEE Transactions on Cognitive and Developmental Systems Année : 2017

Bootstrapping Q-Learning for Robotics from Neuro-Evolution Results

(1) , (1)

Matthieu Zimmer

Fonction : Auteur
PersonId : 9288
IdHAL : matthieu-zimmer
ORCID : 0000-0002-8029-308X

Architectures et modèles d'Adptation et de la cognition

Stephane Doncieux

Fonction : Auteur
PersonId : 3909
IdHAL : stephane-doncieux
ORCID : 0000-0003-1541-054X
IdRef : 089428617

Architectures et modèles d'Adptation et de la cognition

Résumé

Reinforcement learning problems are hard to solve in a robotics context as classical algorithms rely on discrete representations of actions and states, but in robotics both are continuous. A discrete set of actions and states can be defined, but it requires an expertise that may not be available, in particular in open environments. It is proposed to define a process to make a robot build its own representation for a reinforcement learning algorithm. The principle is to first use a direct policy search in the sensori-motor space, i.e. with no predefined discrete sets of states nor actions, and then extract from the corresponding learning traces discrete actions and identify the relevant dimensions of the state to estimate the value function. Once this is done, the robot can apply reinforcement learning (1) to be more robust to new domains and, if required, (2) to learn faster than a direct policy search. This approach allows to take the best of both worlds: first learning in a continuous space to avoid the need of a specific representation, but at a price of a long learning process and a poor generalization, and then learning with an adapted representation to be faster and more robust.

Mots clés

generation of representation during development transfer learning robots with development and learning skills

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

article.pdf (2.38 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Matthieu Zimmer : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01494744

Soumis le : jeudi 23 mars 2017-21:12:51

Dernière modification le : jeudi 28 mars 2024-13:50:04

Archivage à long terme le : samedi 24 juin 2017-16:24:13

Dates et versions

hal-01494744 , version 1 (23-03-2017)

Identifiants

HAL Id : hal-01494744 , version 1
DOI : 10.1109/TCDS.2016.2628817

Citer

Matthieu Zimmer, Stephane Doncieux. Bootstrapping Q-Learning for Robotics from Neuro-Evolution Results. IEEE Transactions on Cognitive and Developmental Systems, 2017, ⟨10.1109/TCDS.2016.2628817⟩. ⟨hal-01494744⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS ISIR GRID5000 SORBONNE-UNIVERSITE SU-SCIENCES SILECS ISIR_AMAC

317 Consultations

452 Téléchargements

Bootstrapping Q-Learning for Robotics from Neuro-Evolution Results

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager