Preference Learning on the Execution of Collaborative Human-Robot Tasks

Thibaut Munzer 1 Marc Toussaint 2 Manuel Lopes 1
1 Flowers - Flowing Epigenetic Robots and Systems
Inria Bordeaux - Sud-Ouest, U2IS - Unité d'Informatique et d'Ingénierie des Systèmes
Abstract : We present a novel method to learn human preferences during, and for, the execution of concurrent joint human-robot tasks. We consider tasks realized by a team of a human operator and a robot helper that should adapt to the human's task execution preferences. Different human operators can have different abilities, experiences, and personal preferences, so that a particular allocation of activities in the team is preferred over another. We cast the behavior of concurrent multi-agent cooperation as a semi Markov Decision Process and show how to model and learn human preferences over the team behavior. After proposing two different interactive learning algorithms, we evaluate them and show that the system can effectively learn and adapt to human preferences.
Liste complète des métadonnées

Cited literature [21 references]  Display  Hide  Download
Contributor : Thiabut Munzer <>
Submitted on : Tuesday, November 21, 2017 - 7:36:48 PM
Last modification on : Monday, December 17, 2018 - 10:34:42 AM


Files produced by the author(s)



Thibaut Munzer, Marc Toussaint, Manuel Lopes. Preference Learning on the Execution of Collaborative Human-Robot Tasks. ICRA 2017 - IEEE International Conference on Robotics and Automation, May 2017, Singapour, Singapore. pp.1-7, ⟨10.1109/ICRA.2017.7989108⟩. ⟨hal-01644014⟩



Record views


Files downloads