A study of FMQ heuristic in cooperative multi-agent games. - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

A study of FMQ heuristic in cooperative multi-agent games.

Résumé

The article focuses on decentralized reinforcement learning (RL) in cooperative multi-agent games, where a team of independent learning agents (ILs) try to coordinate their individual actions to reach an optimal joint action. Within this framework, some algorithms based on Q-learning are proposed in recent works. Especially, we are interested in Distributed Q-learning which finds optimal policies in deterministic games, and in the Frequency Maximum Q value (FMQ) heuristic which is able in partially stochastic matrix games to distinguish if a poor reward received for the same action are due to either miscoordination or to the noisy reward function. Making this distinction is one of the main difficulties to solve stochastic games. Our objective is to find an algorithm able to switch over the updates according to a detection of the cause of noise. In this paper, a modified version of the FMQ heuristic is proposed which achieves this detection and the update adaptation. Moreover, this modified FMQ version is more robust and very easy to set.
Fichier principal
Vignette du fichier
Matignon2008study.pdf (232.03 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00342324 , version 1 (27-11-2008)

Identifiants

  • HAL Id : hal-00342324 , version 1

Citer

Laëtitia Matignon, Guillaume J. Laurent, Nadine Le Fort - Piat. A study of FMQ heuristic in cooperative multi-agent games.. The 7th International Conference on Autonomous Agents and Multiagent Systems. Workshop 10 : Multi-Agent Sequential Decision Making in Uncertain Multi-Agent Domains, aamas'08., May 2008, Estoril, Portugal. pp.77-91. ⟨hal-00342324⟩
270 Consultations
578 Téléchargements

Partager

Gmail Facebook X LinkedIn More