A Polynomial Algorithm for Decentralized Markov Decision Processes with Temporal Constraints

Aurélie Beynier; Abdel-Illah Mouaddib

doi:10.1145/1082473.1082619

Communication Dans Un Congrès Année : 2005

A Polynomial Algorithm for Decentralized Markov Decision Processes with Temporal Constraints

(1) , (2)

1
2

Aurélie Beynier

Fonction : Auteur
PersonId : 9272
IdHAL : aurelie-beynier
IdRef : 113330804

Groupe de Recherche en Informatique, Image et Instrumentation de Caen

Abdel-Illah Mouaddib

Fonction : Auteur
PersonId : 930641

Equipe MAD - Laboratoire GREYC - UMR6072

Résumé

One of the difficulties to adapt MDPs for the control of cooperative multi-agent systems, is the complexity issued from Decentralized MDPs. Moreover, existing approaches can not be used for real applications because they do not take into account complex constraints about the execution. In this paper, we present a class of DEC-MDPs, OC-DEC-MDP, that can handle temporal and precedence constraints. This model allows several autonomous agents to cooperate so as to complete a set of tasks without communication. In order to allow the agents to coordinate, we introduce an opportunity cost. Each agent builds its own local MDP independently of the other agents but, it takes into account the lost in value provoked, by its local decision, on the other agents. Existing approaches solving DEC-MDP are NEXP complete or exponential, while our OC-DEC-MDP can be solved by a polynomial algorithm with good approximation.

Mots clés

Multi-agent systems Markov decision Processes Planning Uncertainty

Domaines

Intelligence artificielle [cs.AI] Système multi-agents [cs.MA]

Fichier principal

aamas266.pdf (161.73 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Aurélie Beynier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01344441

Soumis le : lundi 5 septembre 2016-15:29:40

Dernière modification le : mercredi 20 mars 2024-16:20:04

Archivage à long terme le : mardi 6 décembre 2016-12:05:55

Dates et versions

hal-01344441 , version 1 (05-09-2016)

Identifiants

HAL Id : hal-01344441 , version 1
DOI : 10.1145/1082473.1082619

Citer

Aurélie Beynier, Abdel-Illah Mouaddib. A Polynomial Algorithm for Decentralized Markov Decision Processes with Temporal Constraints. Fourth International joint Conference on Autonomous Agents and Multi Agent Systems (AAMAS'05), 2005, Utrecht, Netherlands. pp.963-969, ⟨10.1145/1082473.1082619⟩. ⟨hal-01344441⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS GREYC GREYC-MAD COMUE-NORMANDIE ENSICAEN UNICAEN

142 Consultations

182 Téléchargements

A Polynomial Algorithm for Decentralized Markov Decision Processes with Temporal Constraints

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager