Skip to Main content Skip to Navigation
Journal articles

A Compromise Programming Approach to Multiobjective Markov Decision Processes

Wlodzimierz Ogryczak Patrice Perny 1 Paul Weng 1
1 DECISION
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : A Markov decision process (MDP) is a general model for solving planning problems under uncertainty. It has been extended to multiobjective MDP to address multicriteria or multiagent problems in which the value of a decision must be evaluated according to several viewpoints, sometimes conflicting. Although most of the studies concentrate on the determination of the set of Pareto-optimal policies, we focus here on a more specialized problem that concerns the direct determination of policies achieving well-balanced tradeoffs. To this end, we introduce a reference point method based on the optimization of a weighted ordered weighted average (WOWA) of individual disachievements. We show that the resulting notion of optimal policy does not satisfy the Bellman principle and depends on the initial state. To overcome these difficulties, we propose a solution method based on a linear programming (LP) reformulation of the problem. Finally, we illustrate the feasibility of the proposed method on two types of planning problems under uncertainty arising in navigation of an autonomous agent and in inventory management.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01170494
Contributor : Lip6 Publications <>
Submitted on : Wednesday, July 1, 2015 - 4:11:27 PM
Last modification on : Thursday, March 21, 2019 - 1:10:43 PM

Links full text

Identifiers

Citation

Wlodzimierz Ogryczak, Patrice Perny, Paul Weng. A Compromise Programming Approach to Multiobjective Markov Decision Processes. International Journal of Information Technology and Decision Making, World Scientific Publishing, 2013, 12 (5), pp.1021-1053. ⟨10.1142/S0219622013400075⟩. ⟨hal-01170494⟩

Share

Metrics

Record views

142