Adaptive reinforcement learning with active state-specific exploration for engagement maximization during simulated child-robot interaction

George Velentzas; Theodore Tsitsimis; Iñaki Rañó; Costas Tzafestas; Mehdi Khamassi

doi:10.1515/pjbr-2018-0016

Article Dans Une Revue Paladyn: Journal of Behavioral Robotics Année : 2018

Adaptive reinforcement learning with active state-specific exploration for engagement maximization during simulated child-robot interaction

(1) , (1) , (2) , (1) , (3)

1
2
3

George Velentzas

Fonction : Auteur

School of of Electrical and Computer Engineering [Athens]

Theodore Tsitsimis

Fonction : Auteur

School of of Electrical and Computer Engineering [Athens]

Iñaki Rañó

Fonction : Auteur

Faculty of Computing and Engineering [University of Ulster]

Costas Tzafestas

Fonction : Auteur

School of of Electrical and Computer Engineering [Athens]

Mehdi Khamassi

Fonction : Auteur
PersonId : 186
IdHAL : mehdi-khamassi
ORCID : 0000-0002-2515-1046
IdRef : 12845072X

Institut des Systèmes Intelligents et de Robotique

Résumé

Using assistive robots for educational applications requires robots to be able to adapt their behavior specifically for each child with whom they interact. Among relevant signals, non-verbal cues such as the child's gaze can provide the robot with important information about the child's current engagement in the task, and whether the robot should continue its current behavior or not. Here we propose a reinforcement learning algorithm extended with active state-specific exploration and show its applicability to child engagement maximization as well as more classical tasks such as maze navigation. We first demonstrate its adaptive nature on a continuous maze problem as an enhancement of the classic grid world. There, parame-terized actions enable the agent to learn single moves until the end of a corridor, similarly to "options" but without explicit hierarchical representations. We then apply the algorithm to a series of simulated scenarios, such as an extended Tower of Hanoi where the robot should find the appropriate speed of movement for the interacting child, and to a pointing task where the robot should find the child-specific appropriate level of expressivity of action. We show that the algorithm enables to cope with both global and local non-stationarities in the state space while preserving a stable behavior in other stationary portions of the state space. Altogether, these results suggest a promising way to enable robot learning based on non-verbal cues and the high degree of non-stationarities that can occur during interaction with children.

Mots clés

reinforcement learning active exploration meta-learning autonomous robotics engagement joint action human-robot interaction

Domaines

Neurosciences [q-bio.NC]

Fichier principal

Velentzas2018_Paladyn.pdf (3.99 Mo)

Origine : Publication financée par une institution

Mehdi Khamassi : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02324073

Soumis le : lundi 21 octobre 2019-18:46:35

Dernière modification le : samedi 7 octobre 2023-21:36:23

Archivage à long terme le : mercredi 22 janvier 2020-19:16:33

Dates et versions

hal-02324073 , version 1 (21-10-2019)

Licence

Paternité - Pas de modifications

Identifiants

HAL Id : hal-02324073 , version 1
DOI : 10.1515/pjbr-2018-0016

Citer

George Velentzas, Theodore Tsitsimis, Iñaki Rañó, Costas Tzafestas, Mehdi Khamassi. Adaptive reinforcement learning with active state-specific exploration for engagement maximization during simulated child-robot interaction. Paladyn: Journal of Behavioral Robotics, 2018, 9 (1), pp.235-253. ⟨10.1515/pjbr-2018-0016⟩. ⟨hal-02324073⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS ISIR SORBONNE-UNIVERSITE SU-SCIENCES ISIR_AMAC

26 Consultations

65 Téléchargements

Adaptive reinforcement learning with active state-specific exploration for engagement maximization during simulated child-robot interaction

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager