State Representation Learning from Demonstration

Astrid Merckling; Alexandre Coninx; Loic Cressot; Stephane Doncieux; Nicolas Perrin

Communication Dans Un Congrès Année : 2020

State Representation Learning from Demonstration

(1) , , (1) , ,

Astrid Merckling

Fonction : Auteur
PersonId : 747734
IdHAL : astrid-merckling
ORCID : 0000-0002-7036-6548
IdRef : 260798924

Institut des Systèmes Intelligents et de Robotique

Alexandre Coninx

Fonction : Auteur
PersonId : 184690
IdHAL : alex-coninx
ORCID : 0000-0001-7992-8183
IdRef : 166602183

Loic Cressot

Fonction : Auteur

Institut des Systèmes Intelligents et de Robotique

Stephane Doncieux

Fonction : Auteur
PersonId : 3909
IdHAL : stephane-doncieux
ORCID : 0000-0003-1541-054X
IdRef : 089428617

Nicolas Perrin

Fonction : Auteur
PersonId : 741992
IdHAL : nicolas-perrin-gilbert
ORCID : 0000-0001-8626-1938
IdRef : 158235509

Résumé

In a context where several policies can be observed as black boxes on different instances of a control task, we propose a method to derive a state representation that can be relied on to reproduce any of the observed policies. We do so via imitation learning on a multi-head neural network consisting of a first part that outputs a common state representation and then one head per policy to imitate. If the demonstrations contain enough diversity, the state representation is general and can be transferred to learn new instances of the task. We present a proof of concept with experimental results on a simulated 2D robotic arm performing a reaching task, with noisy image inputs containing a distractor, and show that the state representations learned provide a greater speed up to end-to-end reinforcement learning on new instances of the task than with other classical representations.

Domaines

Robotique [cs.RO] Apprentissage [cs.LG] Intelligence artificielle [cs.AI]

Nicolas Perrin-Gilbert : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03083156

Soumis le : vendredi 18 décembre 2020-21:03:53

Dernière modification le : samedi 7 octobre 2023-21:36:23

Dates et versions

hal-03083156 , version 1 (18-12-2020)

Identifiants

HAL Id : hal-03083156 , version 1
ARXIV : 1910.01738

Citer

Astrid Merckling, Alexandre Coninx, Loic Cressot, Stephane Doncieux, Nicolas Perrin. State Representation Learning from Demonstration. 6th International Conference on Machine Learning, Optimization, and Data Science, LOD 2020, Jul 2020, Siena, Italy. ⟨hal-03083156⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS ISIR SORBONNE-UNIVERSITE SU-SCIENCES ANR ISIR_AMAC ISIR_SYROCO

75 Consultations

0 Téléchargements

State Representation Learning from Demonstration

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager