Robot Fast Adaptation to Changes in Human Engagement During Simulated Dynamic Social Interaction With Active Exploration in Parameterized Reinforcement Learning

Mehdi Khamassi; George Velentzas; Theodore Tsitsimis; Costas Tzafestas

doi:10.1109/TCDS.2018.2843122

Article Dans Une Revue IEEE Transactions on Cognitive and Developmental Systems Année : 2018

Robot Fast Adaptation to Changes in Human Engagement During Simulated Dynamic Social Interaction With Active Exploration in Parameterized Reinforcement Learning

(1) , (2) , (2) , (2)

1
2

Mehdi Khamassi

Fonction : Auteur
PersonId : 186
IdHAL : mehdi-khamassi
ORCID : 0000-0002-2515-1046
IdRef : 12845072X

Institut des Systèmes Intelligents et de Robotique

George Velentzas

Fonction : Auteur

School of of Electrical and Computer Engineering [Athens]

Theodore Tsitsimis

Fonction : Auteur

School of of Electrical and Computer Engineering [Athens]

Costas Tzafestas

Fonction : Auteur

School of of Electrical and Computer Engineering [Athens]

Résumé

Dynamic uncontrolled human-robot interactions (HRIs) require robots to be able to adapt to changes in the human's behavior and intentions. Among relevant signals, non-verbal cues such as the human's gaze can provide the robot with important information about the human's current engagement in the task, and whether the robot should continue its current behavior or not. However, robot reinforcement learning (RL) abilities to adapt to these nonverbal cues are still underdeveloped. Here, we propose an active exploration algorithm for RL during HRI where the reward function is the weighted sum of the human's current engagement and variations of this engagement. We use a parameterized action space where a meta-learning algorithm is applied to simultaneously tune the exploration in discrete action space (e.g., moving an object) and in the space of continuous characteristics of movement (e.g., velocity, direction, strength, and expressivity). We first show that this algorithm reaches state-of-the-art performance in the nonstationary multiarmed bandit paradigm. We then apply it to a simulated HRI task, and show that it outper-forms continuous parameterized RL with either passive or active exploration based on different existing methods. We finally test the performance in a more realistic test of the same HRI task, where a practical approach is followed to estimate human engagement through visual cues of the head pose. The algorithm can detect and adapt to perturbations in human engagement with different durations. Altogether, these results suggest a novel efficient and robust framework for robot learning during dynamic HRI scenarios.

Domaines

Neurosciences [q-bio.NC]

Fichier principal

Khamassi2018_IEEE-TCDS_08404000.pdf (2 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Mehdi Khamassi : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02324064

Soumis le : lundi 21 octobre 2019-18:44:50

Dernière modification le : jeudi 1 février 2024-14:20:57

Archivage à long terme le : mercredi 22 janvier 2020-18:50:30

Dates et versions

hal-02324064 , version 1 (21-10-2019)

Licence

Paternité

Identifiants

HAL Id : hal-02324064 , version 1
DOI : 10.1109/TCDS.2018.2843122

Citer

Mehdi Khamassi, George Velentzas, Theodore Tsitsimis, Costas Tzafestas. Robot Fast Adaptation to Changes in Human Engagement During Simulated Dynamic Social Interaction With Active Exploration in Parameterized Reinforcement Learning. IEEE Transactions on Cognitive and Developmental Systems, 2018, 10 (4), pp.881-893. ⟨10.1109/TCDS.2018.2843122⟩. ⟨hal-02324064⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS ISIR SORBONNE-UNIVERSITE SU-SCIENCES ANR ISIR_AMAC

22 Consultations

79 Téléchargements

Robot Fast Adaptation to Changes in Human Engagement During Simulated Dynamic Social Interaction With Active Exploration in Parameterized Reinforcement Learning

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager