Solving efficiently Decentralized MDPs with temporal and resource constraints

Aurélie Beynier 1 Abdel-Illah Mouaddib 2
1 SMA - Systèmes Multi-Agents
LIP6 - Laboratoire d'Informatique de Paris 6
2 Equipe MAD - Laboratoire GREYC - UMR6072
GREYC - Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen
Abstract : Optimizing the operation of cooperative multi-agent systems that can deal with large and realistic problems has become an important focal area of research in the multi-agent community. In this paper we first present a new model, the OC-DEC-MDP (Opportunity Cost Decentralized Markov Decision Processes), that allows for representing large multi-agent decision problems with temporal and precedence constraints. Then, we propose polynomial algorithms to efficiently solve problems formalized by OC-DEC-MDPs. The problems we deal with consist of a set of agents that have to execute a set of tasks in a cooperative way. The agents cannot communicate during execution and they have to respect some resource and temporal constraints. Our approach is based on Decentralized Markov Decision Processes (DEC-MDPs) and uses a concept of opportunity cost borrowed from economics to obtain approximate control policies. Currently, the best existing techniques can only solve optimally small problems. Experimental results show that our approach produces good quality solutions for complex problems which are out of reach of existing approaches.
Type de document :
Article dans une revue
Autonomous Agents and Multi-Agent Systems, Springer Verlag, 2011, 23 (3), pp.486 - 539. 〈10.1007/s10458-010-9145-2〉
Liste complète des métadonnées

Littérature citée [40 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01344444
Contributeur : Aurélie Beynier <>
Soumis le : lundi 11 juillet 2016 - 21:44:47
Dernière modification le : mardi 26 septembre 2017 - 01:30:46
Document(s) archivé(s) le : mercredi 12 octobre 2016 - 14:56:03

Fichier

BeynierMouaddibJAAMAS.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Aurélie Beynier, Abdel-Illah Mouaddib. Solving efficiently Decentralized MDPs with temporal and resource constraints. Autonomous Agents and Multi-Agent Systems, Springer Verlag, 2011, 23 (3), pp.486 - 539. 〈10.1007/s10458-010-9145-2〉. 〈hal-01344444〉

Partager

Métriques

Consultations de la notice

169

Téléchargements de fichiers

45