Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems.

Laëtitia Matignon; Guillaume J. Laurent; Nadine Le Fort-Piat

doi:10.1017/S026988891200057

Article Dans Une Revue Knowledge Engineering Review Année : 2012

Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems.

(1) , (1) , (1)

Laëtitia Matignon

Fonction : Auteur
PersonId : 3290
IdHAL : laetitia-matignon
ORCID : 0000-0001-7126-8715
IdRef : 134644239

Franche-Comté Électronique Mécanique, Thermique et Optique - Sciences et Technologies (UMR 6174)

Guillaume J. Laurent

Fonction : Auteur
PersonId : 928139

Franche-Comté Électronique Mécanique, Thermique et Optique - Sciences et Technologies (UMR 6174)

Nadine Le Fort-Piat

Fonction : Auteur
PersonId : 853953

Franche-Comté Électronique Mécanique, Thermique et Optique - Sciences et Technologies (UMR 6174)

Résumé

In the framework of fully cooperative multi-agent systems, independent (non-communicative) agents that learn by reinforcement must overcome several difficulties to manage to coordinate. This paper identifies several challenges responsible for the non-coordination of independent agents: Pareto-selection, nonstationarity, stochasticity, alter-exploration and shadowed equilibria. A selection of multi-agent domains is classified according to those challenges: matrix games, Boutilier's coordination game, predators pursuit domains and a special multi-state game. Moreover the performance of a range of algorithms for independent reinforcement learners is evaluated empirically. Those algorithms are Q-learning variants: decentralized Q-learning, distributed Q-learning, hysteretic Q-learning, recursive FMQ and WoLF PHC. An overview of the learning algorithms' strengths and weaknesses against each challenge concludes the paper and can serve as a basis for choosing the appropriate algorithm for a new domain. Furthermore, the distilled challenges may assist in the design of new learning algorithms that overcome these problems and achieve higher performance in multi-agent applications.

Domaines

Micro et nanotechnologies/Microélectronique

Fichier principal

Matignon2012independent.pdf (6.06 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Martine Azema : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00720669

Soumis le : mercredi 25 juillet 2012-12:59:13

Dernière modification le : jeudi 13 avril 2023-09:26:12

Archivage à long terme le : vendredi 26 octobre 2012-02:45:10

Dates et versions

hal-00720669 , version 1 (25-07-2012)

Identifiants

HAL Id : hal-00720669 , version 1
DOI : 10.1017/S026988891200057

Citer

Laëtitia Matignon, Guillaume J. Laurent, Nadine Le Fort-Piat. Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems.. Knowledge Engineering Review, 2012, 27 (1), pp.1-31. ⟨10.1017/S026988891200057⟩. ⟨hal-00720669⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-FCOMTE UNIV-BM FEMTO-ST UNIV-BM-THESE LABEXIMU

599 Consultations

1926 Téléchargements

Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems.

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager