A Polynomial Algorithm for Decentralized Markov Decision Processes with Temporal Constraints - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2005

A Polynomial Algorithm for Decentralized Markov Decision Processes with Temporal Constraints

Résumé

One of the difficulties to adapt MDPs for the control of cooperative multi-agent systems, is the complexity issued from Decentralized MDPs. Moreover, existing approaches can not be used for real applications because they do not take into account complex constraints about the execution. In this paper, we present a class of DEC-MDPs, OC-DEC-MDP, that can handle temporal and precedence constraints. This model allows several autonomous agents to cooperate so as to complete a set of tasks without communication. In order to allow the agents to coordinate, we introduce an opportunity cost. Each agent builds its own local MDP independently of the other agents but, it takes into account the lost in value provoked, by its local decision, on the other agents. Existing approaches solving DEC-MDP are NEXP complete or exponential, while our OC-DEC-MDP can be solved by a polynomial algorithm with good approximation.
Fichier principal
Vignette du fichier
aamas266.pdf (161.73 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01344441 , version 1 (05-09-2016)

Identifiants

Citer

Aurélie Beynier, Abdel-Illah Mouaddib. A Polynomial Algorithm for Decentralized Markov Decision Processes with Temporal Constraints. Fourth International joint Conference on Autonomous Agents and Multi Agent Systems (AAMAS'05), 2005, Utrecht, Netherlands. pp.963-969, ⟨10.1145/1082473.1082619⟩. ⟨hal-01344441⟩
142 Consultations
182 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More