Policy Iteration Algorithms for DEC-POMDPs with discounted rewards

Jilles Dibangoye; Braim Chaid-Draa; Abdel-Illah Mouaddib

Communication Dans Un Congrès Année : 2009

Policy Iteration Algorithms for DEC-POMDPs with discounted rewards

(1) , (1) , (2)

1
2

Jilles Dibangoye

Fonction : Auteur
PersonId : 954596

Université Laval [Québec]

Braim Chaid-Draa

Fonction : Auteur
PersonId : 954597

Université Laval [Québec]

Abdel-Illah Mouaddib

Fonction : Auteur
PersonId : 953440

Equipe MAD - Laboratoire GREYC - UMR6072

Résumé

Over the past seven years, researchers have been trying to find algorithms for the decentralized control of multiple agent under uncertainty. Unfortunately, most of the standard methods are unable to scale to real-world-size domains. In this paper, we come up with promising new theoretical insights to build scalable algorithms with provable error bounds. In the light of the new theoretical insights, this research revisits the policy iteration algorithm for the decentralized partially observable Markov decision process (DEC-POMDP). We derive and analyze the first point-based policy iteration algorithmswith provable error bounds. Our experimental results show that we are able to successfully solve all tested DEC-POMDP benchmarks: outperforming standard algorithms, both in solution time and policy quality.

Domaines

Intelligence artificielle [cs.AI] Modélisation et simulation Système multi-agents [cs.MA]

Fichier principal

acti-dibangoye-2009-3.pdf (266.16 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Greyc Référent : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00968744

Soumis le : mardi 1 avril 2014-14:45:20

Dernière modification le : mercredi 20 mars 2024-16:20:04

Archivage à long terme le : mardi 1 juillet 2014-11:55:55

Dates et versions

hal-00968744 , version 1 (01-04-2014)

Identifiants

HAL Id : hal-00968744 , version 1

Citer

Jilles Dibangoye, Braim Chaid-Draa, Abdel-Illah Mouaddib. Policy Iteration Algorithms for DEC-POMDPs with discounted rewards. Proc. AAMAS 2009 Workshop on Multi-agent Sequential Decision-Making in Uncertain Domains (MSDM~2009), 2009, Budapest, Hungary. ⟨hal-00968744⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS GREYC GREYC-MAD COMUE-NORMANDIE TDS-MACS ENSICAEN UNICAEN

88 Consultations

65 Téléchargements

Policy Iteration Algorithms for DEC-POMDPs with discounted rewards

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager