On Minimizing Ordered Weighted Regrets in Multiobjective Markov Decision Processes

Wlodzimierz Ogryczak; Patrice Perny; Paul Weng

doi:10.1007/978-3-642-24873-3_15

Communication Dans Un Congrès Année : 2011

On Minimizing Ordered Weighted Regrets in Multiobjective Markov Decision Processes

, (1) , (1)

Wlodzimierz Ogryczak

Fonction : Auteur

Patrice Perny

Fonction : Auteur
PersonId : 9264
IdHAL : patrice-perny
IdRef : 11341689X

DECISION

Paul Weng

Fonction : Auteur
PersonId : 952563

DECISION

Résumé

In this paper, we propose an exact solution method to generate fair policies in Multiobjective Markov Decision Processes (MMDPs). MMDPs consider n immediate reward functions, representing either individual payoffs in a multiagent problem or rewards with respect to different objectives. In this context, we focus on the determination of a policy that fairly shares regrets among agents or objectives, the regret being defined on each dimension as the opportunity loss with respect to optimal expected rewards. To this end, we propose to minimize the ordered weighted average of regrets (OWR). The OWR criterion indeed extends the minimax regret, relaxing egalitarianism for a milder notion of fairness. After showing that OWR-optimality is state-dependent and that the Bellman principle does not hold for OWR-optimal policies, we propose a linear programming reformulation of the problem. We also provide experimental results showing the efficiency of our approach.

Domaines

Informatique [cs]

Lip6 Publications : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01285802

Soumis le : mercredi 9 mars 2016-17:05:52

Dernière modification le : mardi 11 avril 2023-15:16:28

Dates et versions

hal-01285802 , version 1 (09-03-2016)

Identifiants

HAL Id : hal-01285802 , version 1
DOI : 10.1007/978-3-642-24873-3_15

Citer

Wlodzimierz Ogryczak, Patrice Perny, Paul Weng. On Minimizing Ordered Weighted Regrets in Multiobjective Markov Decision Processes. 2nd International Conference on Algorithmic Decision Theory (ADT'11), Oct 2011, Piscataway, NJ, United States. pp.190-204, ⟨10.1007/978-3-642-24873-3_15⟩. ⟨hal-01285802⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

93 Consultations

0 Téléchargements

On Minimizing Ordered Weighted Regrets in Multiobjective Markov Decision Processes

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager