Étude de la motivation intrinsèque en apprentissage par renforcement

Arthur Aubret; Laëtitia Matignon; Salima Hassas

Communication Dans Un Congrès Année : 2019

Étude de la motivation intrinsèque en apprentissage par renforcement

(1) , (1) , (1)

Arthur Aubret

Fonction : Auteur
PersonId : 176995
IdHAL : arthur-aubret
ORCID : 0000-0003-3495-4323

Systèmes Cognitifs et Systèmes Multi-Agents

Laëtitia Matignon

Fonction : Auteur
PersonId : 3290
IdHAL : laetitia-matignon
ORCID : 0000-0001-7126-8715
IdRef : 134644239

Systèmes Cognitifs et Systèmes Multi-Agents

Salima Hassas

Fonction : Auteur
PersonId : 3291
IdHAL : salima-hassas
ORCID : 0000-0002-1387-2866
IdRef : 083298398

Systèmes Cognitifs et Systèmes Multi-Agents

Résumé

Despite many existing works in reinforcement learning (RL) and the recent successes obtained by combining it with deep learning, RL is facing many challenges. Some of them, like the ability to abstract the action or the difficulty to conceive a reward function without expert knowledge, can be addressed by the use of intrinsic motivation. In this article, we provide a survey on the role of intrinsic motivation in RL and its different usages by detailing interests and limits of existing approaches. Our analysis suggests that mutual information is central to most of the work using intrinsic motivation in RL. The combination of deep RL and intrinsic motivation enables to learn more complicated and more generalisable behaviours than what enables standard RL.

Malgré les nombreux travaux existants en apprentissage par renforcement (AR) et les récents succès obtenus notamment en le combinant avec l'apprentissage profond, l'AR fait encore aujourd'hui face à de nombreux défis. Certains d'entre eux, comme la problématique de l'abstraction temporelle des actions ou la difficulté de concevoir une fonction de récompense sans connaissances ex-pertes, peuvent être adressées par l'utilisation de récompenses intrinsèques. Dans cet article, nous proposons une étude du rôle de la motivation intrinsèque en AR et de ses différents usages, en détaillant les intérêts et les limites des approches existantes. Notre analyse suggère que la notion d'information mutuelle est centrale à la plupart des travaux utilisant la motivation intrinsèque en AR. Celle-ci, combinée aux algorithmes d'AR profond, permet d'apprendre des comportements plus complexes et plus généralisables que ce que permet l'AR traditionnel.

Mots clés

Reinforcement learning intrinsic motivation curiosity knowledge acquisition options generation of objectives meta-reward

Apprentissage par renforcement motivation intrinsèque curiosité acquisition de connaissances empowerment options génération d'objectifs

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

JFPDA2019_2.pdf (274.45 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Arthur Aubret : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02272091

Soumis le : mercredi 28 août 2019-13:59:45

Dernière modification le : mercredi 27 mars 2024-09:28:03

Dates et versions

hal-02272091 , version 1 (28-08-2019)

Identifiants

HAL Id : hal-02272091 , version 1

Citer

Arthur Aubret, Laëtitia Matignon, Salima Hassas. Étude de la motivation intrinsèque en apprentissage par renforcement. Journées Francophones sur la Planification, la Décision et l'Apprentissage pour la conduite de systèmes, Jul 2019, Toulouse, France. ⟨hal-02272091⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS INSA-GROUPE UDL

523 Consultations

667 Téléchargements

Étude de la motivation intrinsèque en apprentissage par renforcement

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager