Algebraic Markov Decision Processes

Patrice Perny; Olivier Spanjaard; Paul Weng

Communication Dans Un Congrès Année : 2005

Algebraic Markov Decision Processes

(1) , (1) , (1)

Patrice Perny

Fonction : Auteur
PersonId : 9264
IdHAL : patrice-perny
IdRef : 11341689X

DECISION

Olivier Spanjaard

Fonction : Auteur
PersonId : 14601
IdHAL : olivier-spanjaard
ORCID : 0000-0002-9948-090X
IdRef : 081158882

DECISION

Paul Weng

Fonction : Auteur
PersonId : 952563

DECISION

Résumé

In this paper, we provide an algebraic approach to Markov Decision Processes (MDPs), which allows a unified treatment of MDPs and includes many existing models (quantitative or qualitative) as particular cases. In algebraic MDPs, rewards are expressed in a semiring structure, uncertainty is represented by a decomposable plausibility measure valued on a second semiring structure, and preferences over policies are represented by Generalized Expected Utility. We recast the problem of finding an optimal policy at a finite horizon as an algebraic path problem in a decision rule graph where arcs are valued by functions, which justifies the use of the Jacobi algorithm to solve algebraic Bellman equations. In order to show the potential of this general approach, we exhibit new variations of MDPs, admitting complete or partial preference structures, as well as probabilistic or possibilistic representation of uncertainty.

Domaines

Informatique [cs]

Fichier principal

pub_359_1_1677.pdf (191.63 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Lip6 Publications : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01492606

Soumis le : lundi 10 juillet 2017-18:11:02

Dernière modification le : mardi 11 avril 2023-15:16:28

Archivage à long terme le : mercredi 24 janvier 2018-18:40:09

Dates et versions

hal-01492606 , version 1 (10-07-2017)

Identifiants

HAL Id : hal-01492606 , version 1

Citer

Patrice Perny, Olivier Spanjaard, Paul Weng. Algebraic Markov Decision Processes. 19th International Joint Conference on Artificial Intelligence, Jul 2005, Edinburgh, United Kingdom. pp.1372-1377. ⟨hal-01492606⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

169 Consultations

143 Téléchargements

Algebraic Markov Decision Processes

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager