Producing efficient error-bounded solutions for transition independent decentralized MDPs - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Producing efficient error-bounded solutions for transition independent decentralized MDPs

Résumé

There has been substantial progress on algorithms for single-agent sequential decision making problems represented as partially observable Markov decision processes (POMDPs). A number of efficient algorithms for solving POMDPs share two desirable properties: error-bounds and fast convergence rates. Despite significant efforts, no algorithms for solving decentralized POMDPs benefit from these properties, leading to either poor solution quality or limited scalability. This paper presents the first approach for solving transition independent decentralized Markov decision processes (MDPs), that inherits these properties. Two related algorithms illustrate this approach. The first recasts the original problem as a finite-horizon deterministic and completely observable Markov decision process. In this form, the original problem is solved by combining heuristic search with constraint optimization to quickly converge into a near-optimal policy. This algorithm also provides the foundation for the first algorithm for solving infinite-horizon transition independent decentralized MDPs. We demonstrate that both methods outperform state-of-the-art algorithms by multiple orders of magnitude, and for infinite-horizon decentralized MDPs, the algorithm is able to construct more concise policies by searching cyclic policy graphs.
Fichier principal
Vignette du fichier
fp721-Dibangoye.pdf (217.22 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00918066 , version 1 (12-12-2013)

Identifiants

  • HAL Id : hal-00918066 , version 1

Citer

Jilles Steeve Dibangoye, Christopher Amato, Arnaud Doniec, François Charpillet. Producing efficient error-bounded solutions for transition independent decentralized MDPs. International conference on Autonomous Agents and Multi-Agent Systems, May 2013, Saint Paul, MN, United States. pp.539-546. ⟨hal-00918066⟩
254 Consultations
169 Téléchargements

Partager

Gmail Facebook X LinkedIn More