Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems. - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Knowledge Engineering Review Année : 2012

Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems.

Résumé

In the framework of fully cooperative multi-agent systems, independent (non-communicative) agents that learn by reinforcement must overcome several difficulties to manage to coordinate. This paper identifies several challenges responsible for the non-coordination of independent agents: Pareto-selection, nonstationarity, stochasticity, alter-exploration and shadowed equilibria. A selection of multi-agent domains is classified according to those challenges: matrix games, Boutilier's coordination game, predators pursuit domains and a special multi-state game. Moreover the performance of a range of algorithms for independent reinforcement learners is evaluated empirically. Those algorithms are Q-learning variants: decentralized Q-learning, distributed Q-learning, hysteretic Q-learning, recursive FMQ and WoLF PHC. An overview of the learning algorithms' strengths and weaknesses against each challenge concludes the paper and can serve as a basis for choosing the appropriate algorithm for a new domain. Furthermore, the distilled challenges may assist in the design of new learning algorithms that overcome these problems and achieve higher performance in multi-agent applications.
Fichier principal
Vignette du fichier
Matignon2012independent.pdf (6.06 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00720669 , version 1 (25-07-2012)

Identifiants

Citer

Laëtitia Matignon, Guillaume J. Laurent, Nadine Le Fort-Piat. Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems.. Knowledge Engineering Review, 2012, 27 (1), pp.1-31. ⟨10.1017/S026988891200057⟩. ⟨hal-00720669⟩
599 Consultations
1926 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More