Integrating an Observer in Interactive Reinforcement Learning to Learn Legible Trajectories - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Integrating an Observer in Interactive Reinforcement Learning to Learn Legible Trajectories

Mohamed Chetouani

Résumé

An important aspect of Human-Robot-cooperation is that the robot is capable of clearly communicating its intentions to its human collaborator. This communication of intentions often requires the generation of legible motion trajectories. The concept of legible motion is usually not studied together with machine learning. Studying these fields together is an important step towards better Human-Robot cooperation. In this paper, we investigate interactive robot learning approaches with the aim of developing models that are able to generate legible motions by taking observer feedback into account. We explore how to integrate the observer feedback into a Reinforcement Learning (RL) framework. We do this by proposing three different observer algorithms as observer strategies in an interactive RL scheme and compare with one non-interactive RL algorithm as baseline. For the observer strategies we vary the method how the observer estimates how likely the agent is going for the target goal. We evaluate our approach on five environments and calculate the legibility of the learned trajectories. The results show that the legibility of the learned trajectories is significantly higher while integrating the feedback from the observer compared with a standard Q-Learning algorithm not using the observer feedback.
Fichier principal
Vignette du fichier
Ro_man2020(5).pdf (550.01 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02984877 , version 1 (01-11-2020)

Identifiants

Citer

Manuel Bied, Mohamed Chetouani. Integrating an Observer in Interactive Reinforcement Learning to Learn Legible Trajectories. 2020 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), Aug 2020, Naples, Italy. pp.760-767, ⟨10.1109/RO-MAN47096.2020.9223338⟩. ⟨hal-02984877⟩
52 Consultations
295 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More