Adaptive and Personalised Robots - Learning from Users' Feedback

Abir-Beatrice Karami; Karim Sehaba; Benoit Encelle

doi:10.1109/ICTAI.2013.98

Communication Dans Un Congrès Année : 2013

Adaptive and Personalised Robots - Learning from Users' Feedback

(1) , (1) , (1)

Abir-Beatrice Karami

Fonction : Auteur correspondant
PersonId : 5643
IdHAL : abir-b-karami
ORCID : 0000-0003-1972-5629
IdRef : 160681960

Connectez-vous pour contacter l'auteur

Supporting Interaction and Learning by Experience

Karim Sehaba

Fonction : Auteur
PersonId : 5239
IdHAL : karim-sehaba
IdRef : 111143624

Supporting Interaction and Learning by Experience

Benoit Encelle

Fonction : Auteur
PersonId : 7106
IdHAL : benoit-encelle
ORCID : 0000-0002-0734-6480
IdRef : 103924787

Supporting Interaction and Learning by Experience

Résumé

Service robots have become increasingly important subjects in our lives. However, they are still facing problems like adaptability to their users. While major work has focused on intelligent service robots, the proposed approaches were mostly user independent. Our work is part of the FUI-RoboPopuli project, which concentrates on endowing entertainment companion robots with adaptive and social behaviour. In particular, we are interested in robots that are able to learn and plan so that they adapt and personalize their behaviour according to their users. Markov Decision Processes (MDPs) are largely used for adaptive robots applications. However, one challenging point is reducing the sample complexity required to learn an MDP model, including the reward function. In this article, we present our contribution regarding the representation and the learning of the reward function through analysing interaction traces (i.e. the interaction history between the robot and their users, including users' feedback). Our approach permits to generalise the learned rewards so that when new users are introduced, the robot may quickly adapt using what it learned from previous experiences with other users. We propose, in this article, two algorithms to learn the reward function. The first is direct and certain; the robot applies with a user what it learned during interaction with same kind of users (i.e. users with similar profiles). The second algorithm generalises what it learns to be applied to all kinds of users. Through simulation, we show that the generalised algorithm converges to an optimal reward function with less than half the samples needed by the direct algorithm.

Mots clés

Adaptive and personalised robots Learning from users feedback Markov Decision Processes MDPs

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

Adaptive-Robots-ICTAI2013.pdf (568.08 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Abir B. Karami : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01221445

Soumis le : mercredi 28 octobre 2015-16:10:22

Dernière modification le : mercredi 27 mars 2024-09:18:02

Archivage à long terme le : vendredi 29 janvier 2016-13:12:28

Dates et versions

hal-01221445 , version 1 (28-10-2015)

Identifiants

HAL Id : hal-01221445 , version 1
DOI : 10.1109/ICTAI.2013.98

Citer

Abir-Beatrice Karami, Karim Sehaba, Benoit Encelle. Adaptive and Personalised Robots - Learning from Users' Feedback. IEEE 25th International Conference on Tools with Artificial Intelligence, Nov 2013, Washington DC, United States. ⟨10.1109/ICTAI.2013.98⟩. ⟨hal-01221445⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS LABEXIMU INSA-GROUPE UDL

152 Consultations

270 Téléchargements

Adaptive and Personalised Robots - Learning from Users' Feedback

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager