Sequential Decision-Making under Non-stationary Environments via Sequential Change-point Detection

Emmanuel Hadoux; Aurélie Beynier; Paul Weng

Communication Dans Un Congrès Année : 2014

Sequential Decision-Making under Non-stationary Environments via Sequential Change-point Detection

(1) , (1) , (2)

1
2

Emmanuel Hadoux

Fonction : Auteur
PersonId : 6716
IdHAL : emmanuel-hadoux
IdRef : 192282492

Systèmes Multi-Agents

Aurélie Beynier

Fonction : Auteur
PersonId : 9272
IdHAL : aurelie-beynier
IdRef : 113330804

Systèmes Multi-Agents

Paul Weng

Fonction : Auteur

DECISION

Résumé

Reinforcement Learning (RL) has been mainly interested in computing an optimal policy for an agent acting in a stationary environment. However, in many real world decision problems the assumption on the stationarity does not hold. One can view a non-stationary environment as a set of contexts (also called modes or modules) where a context corresponds to a possible stationary dynamics of the environment. Even most approaches assume that the number of modes is known, a RL method-Reinforcement Learning with Context Detection (RLCD)-has been recently proposed to learn an a pirori unknown set of contexts and detect context changes. In this paper, we propose a new approach by adapting the tools developed in statistics and more precisely in sequential analysis for detecting an environmental change. Our approach is thus more theoretically founded and necessitates less parameters than RLCD. We also show that our parameters are easier to interpret and therefore easier to tune. Finally, we show experimentally that our approach out-performs the current methods on several application problems.

Domaines

Intelligence artificielle [cs.AI] Apprentissage [cs.LG]

Fichier principal

LMCE14.pdf (268.54 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Hadoux : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01200817

Soumis le : jeudi 17 septembre 2015-15:43:01

Dernière modification le : mardi 11 avril 2023-15:16:28

Archivage à long terme le : mardi 29 décembre 2015-07:43:25

Dates et versions

hal-01200817 , version 1 (17-09-2015)

Identifiants

HAL Id : hal-01200817 , version 1

Citer

Emmanuel Hadoux, Aurélie Beynier, Paul Weng. Sequential Decision-Making under Non-stationary Environments via Sequential Change-point Detection. Learning over Multiple Contexts (LMCE), Sep 2014, Nancy, France. ⟨hal-01200817⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES ANR

575 Consultations

794 Téléchargements

Sequential Decision-Making under Non-stationary Environments via Sequential Change-point Detection

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager