Simultaneous Acquisition of Task and Feedback Models

Manuel Lopes 1 Thomas Cederborg 1 Pierre-Yves Oudeyer 1
1 Flowers - Flowing Epigenetic Robots and Systems
Inria Bordeaux - Sud-Ouest, ENSTA ParisTech U2IS - Unité d'Informatique et d'Ingénierie des Systèmes
Abstract : We present a system to learn task representations from ambiguous feedback. We consider an inverse reinforcement learner that receives feedback from a teacher with an unknown and noisy protocol. The system needs to estimate simultaneously what the task is (i.e. how to find a compact representation to the task goal), and how the teacher is providing the feedback. We further explore the problem of ambiguous protocols by considering that the words used by the teacher have an unknown relation with the action and meaning expected by the robot. This allows the system to start with a set of known signs and learn the meaning of new ones. We present computational results that show that it is possible to learn the task under a noisy and ambiguous feedback. Using an active learning approach, the system is able to reduce the length of the training period.
Type de document :
Communication dans un congrès
Development and Learning (ICDL), 2011 IEEE International Conference on, 2011, Germany. pp.1 - 7, 2011, <10.1109/DEVLRN.2011.6037359>
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-00636166
Contributeur : Manuel Lopes <>
Soumis le : mercredi 26 octobre 2011 - 22:21:08
Dernière modification le : vendredi 6 janvier 2017 - 01:22:40
Document(s) archivé(s) le : vendredi 27 janvier 2012 - 02:36:35

Fichier

11-icdl.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Manuel Lopes, Thomas Cederborg, Pierre-Yves Oudeyer. Simultaneous Acquisition of Task and Feedback Models. Development and Learning (ICDL), 2011 IEEE International Conference on, 2011, Germany. pp.1 - 7, 2011, <10.1109/DEVLRN.2011.6037359>. <hal-00636166>

Partager

Métriques

Consultations de
la notice

260

Téléchargements du document

106