Autotelic Reinforcement Learning in Multi-Agent Environments

Eleni Nisioti; Elías Masquil; Gautier Hamon; And Clément Moulin-Frier

Communication Dans Un Congrès Année : 2023

Autotelic Reinforcement Learning in Multi-Agent Environments

(1) , (1) , (1) , (1)

Eleni Nisioti

Fonction : Auteur
PersonId : 750895
IdHAL : eleni-nisioti

Flowing Epigenetic Robots and Systems

Elías Masquil

Fonction : Auteur

Flowing Epigenetic Robots and Systems

Gautier Hamon

Fonction : Auteur
PersonId : 751577
IdHAL : gautier-hamon
ORCID : 0000-0002-4326-7296

Flowing Epigenetic Robots and Systems

And Clément Moulin-Frier

Fonction : Auteur

Flowing Epigenetic Robots and Systems

Résumé

How can a population of reinforcement learning agents autonomously learn a diversity of cooperative tasks in a shared environment? In the single-agent paradigm, goal-conditioned policies have been combined with intrinsic motivation mechanisms to endow agents with the ability to master a wide diversity of autonomously discovered goals. Transferring this idea to cooperative multi-agent systems (MAS) entails a challenge: intrinsically motivated agents that sample goals independently focus on a shared cooperative goal with low probability, impairing their learning performance. In this work, we propose a new learning paradigm for modeling such settings, the Decentralized Intrinsically Motivated Skill Acquisition Problem (Dec-IMSAP), and employ it to solve cooperative navigation tasks. Agents in a Dec-IMSAP are trained in a fully decentralized way, which comes in contrast to previous contributions in multi-goal MAS that consider a centralized goal-selection mechanism. Our empirical analysis indicates that a sufficient condition for efficiently learning a diversity of cooperative tasks is to ensure that a group aligns its goals, i.e., the agents pursue the same cooperative goal and learn to coordinate their actions through specialization. We introduce the Goal-coordination game, a fully-decentralized emergent communication algorithm, where goal alignment emerges from the maximization of individual rewards in multi-goal cooperative environments and show that it is able to reach equal performance to a centralized training baseline that guarantees aligned goals. To our knowledge, this is the first contribution addressing the problem of intrinsically motivated multi-agent goal exploration in a decentralized training paradigm.

Mots clés

Multi-agent Learning Goal-conditioned Learning Intrinsic Moti- vation Reinforcement Learning Emergent Communication

Domaines

Informatique [cs]

Eleni Nisioti : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03898121

Soumis le : mercredi 14 décembre 2022-11:46:25

Dernière modification le : vendredi 19 janvier 2024-14:43:39

Dates et versions

hal-03898121 , version 1 (14-12-2022)

Licence

Paternité

Identifiants

HAL Id : hal-03898121 , version 1
ARXIV : 2211.06082

Citer

Eleni Nisioti, Elías Masquil, Gautier Hamon, And Clément Moulin-Frier. Autotelic Reinforcement Learning in Multi-Agent Environments. CoLLAs 2023, Conference on Lifelong Learning Agents, Aug 2023, Montréal, Canada. ⟨hal-03898121⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2 ANR

55 Consultations

0 Téléchargements

Autotelic Reinforcement Learning in Multi-Agent Environments

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager