A study of FMQ heuristic in cooperative multi-agent games.

Laëtitia Matignon; Guillaume J. Laurent; Nadine Le Fort - Piat

Communication Dans Un Congrès Année : 2008

A study of FMQ heuristic in cooperative multi-agent games.

(1) , (1) , (1)

Laëtitia Matignon

Fonction : Auteur
PersonId : 3290
IdHAL : laetitia-matignon
ORCID : 0000-0001-7126-8715
IdRef : 134644239

Franche-Comté Électronique Mécanique, Thermique et Optique - Sciences et Technologies (UMR 6174)

Guillaume J. Laurent

Fonction : Auteur
PersonId : 854827

Franche-Comté Électronique Mécanique, Thermique et Optique - Sciences et Technologies (UMR 6174)

Nadine Le Fort - Piat

Fonction : Auteur
PersonId : 853953

Franche-Comté Électronique Mécanique, Thermique et Optique - Sciences et Technologies (UMR 6174)

Résumé

The article focuses on decentralized reinforcement learning (RL) in cooperative multi-agent games, where a team of independent learning agents (ILs) try to coordinate their individual actions to reach an optimal joint action. Within this framework, some algorithms based on Q-learning are proposed in recent works. Especially, we are interested in Distributed Q-learning which finds optimal policies in deterministic games, and in the Frequency Maximum Q value (FMQ) heuristic which is able in partially stochastic matrix games to distinguish if a poor reward received for the same action are due to either miscoordination or to the noisy reward function. Making this distinction is one of the main difficulties to solve stochastic games. Our objective is to find an algorithm able to switch over the updates according to a detection of the cause of noise. In this paper, a modified version of the FMQ heuristic is proposed which achieves this detection and the update adaptation. Moreover, this modified FMQ version is more robust and very easy to set.

Domaines

Micro et nanotechnologies/Microélectronique

Fichier principal

Matignon2008study.pdf (232.03 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Martine Azema : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00342324

Soumis le : jeudi 27 novembre 2008-10:58:15

Dernière modification le : jeudi 13 avril 2023-09:26:11

Archivage à long terme le : lundi 7 juin 2010-20:29:47

Dates et versions

hal-00342324 , version 1 (27-11-2008)

Identifiants

HAL Id : hal-00342324 , version 1

Citer

Laëtitia Matignon, Guillaume J. Laurent, Nadine Le Fort - Piat. A study of FMQ heuristic in cooperative multi-agent games.. The 7th International Conference on Autonomous Agents and Multiagent Systems. Workshop 10 : Multi-Agent Sequential Decision Making in Uncertain Multi-Agent Domains, aamas'08., May 2008, Estoril, Portugal. pp.77-91. ⟨hal-00342324⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-FCOMTE UNIV-BM FEMTO-ST UNIV-BM-THESE LABEXIMU

270 Consultations

578 Téléchargements

A study of FMQ heuristic in cooperative multi-agent games.

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager