Reducing the Number of Queries in Interactive Value Iteration

Hugo Gilbert; Olivier Spanjaard; Paolo Viappiani; Paul Weng

doi:10.1007/978-3-319-23114-3_9

Communication Dans Un Congrès Année : 2015

Reducing the Number of Queries in Interactive Value Iteration

(1) , (1) , (1) , (2, 3)

1
2
3

Hugo Gilbert

Fonction : Auteur
PersonId : 971449

DECISION

Olivier Spanjaard

Fonction : Auteur
PersonId : 14601
IdHAL : olivier-spanjaard
ORCID : 0000-0002-9948-090X
IdRef : 081158882

DECISION

Paolo Viappiani

Fonction : Auteur
PersonId : 9572
IdHAL : paolo-viappiani
ORCID : 0000-0002-7922-3877
IdRef : 178446521

DECISION

Paul Weng

Fonction : Auteur

SYSU-CMU Joint Institute of Engineering

SYSU-CMU Shunde International Joint Research Institute

Résumé

To tackle the potentially hard task of defining the reward function in a Markov Decision Process (MDPs), a new approach, called Interactive Value Iteration (IVI) has recently been proposed by Weng and Zanuttini (2013). This solving method, which interweaves elicitation and optimization phases, computes a (near) optimal policy without knowing the precise reward values. The procedure as originally presented can be improved in order to reduce the number of queries needed to determine an optimal policy. The key insights are that (1) asking queries should be delayed as much as possible, avoiding asking queries that might not be necessary to determine the best policy, (2) queries should be asked by following a priority order because the answers to some queries can enable to resolve some other queries, (3) queries can be avoided by using heuristic information to guide the process. Following these ideas, a modified IVI algorithm is presented and experimental results show a significant decrease in the number of queries issued.

Domaines

Informatique [cs]

Fichier principal

IEIVI.pdf (347.78 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Lip6 Publications : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01213280

Soumis le : vendredi 30 juin 2017-18:35:13

Dernière modification le : mardi 11 avril 2023-15:16:28

Archivage à long terme le : lundi 22 janvier 2018-22:16:44

Dates et versions

hal-01213280 , version 1 (30-06-2017)

Identifiants

HAL Id : hal-01213280 , version 1
DOI : 10.1007/978-3-319-23114-3_9

Citer

Hugo Gilbert, Olivier Spanjaard, Paolo Viappiani, Paul Weng. Reducing the Number of Queries in Interactive Value Iteration. 4th International Conference on Algorithmic Decision Theory (ADT 2015), Sep 2015, Lexington, KY, United States. pp.139-152, ⟨10.1007/978-3-319-23114-3_9⟩. ⟨hal-01213280⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

74 Consultations

105 Téléchargements

Reducing the Number of Queries in Interactive Value Iteration

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager