Stationary Mixing Bandits

Julien Audiffren; Liva Ralaivola

Pré-Publication, Document De Travail Année : 2014

Stationary Mixing Bandits

(1) , (2)

1
2

Julien Audiffren

Fonction : Auteur
PersonId : 7257
IdHAL : julien-audiffren
ORCID : 0000-0003-4321-2575
IdRef : 160139732

Centre de Mathématiques et de Leurs Applications

Liva Ralaivola

Fonction : Auteur
PersonId : 5004
IdHAL : livaralaivola
ORCID : 0000-0002-4571-1119
IdRef : 089319060

éQuipe AppRentissage et MultimediA [Marseille]

Résumé

We study the bandit problem where arms are associated with stationary phi-mixing processes and where rewards are therefore dependent: the question that arises from this setting is that of recovering some independence by ignoring the value of some rewards. As we shall see, the bandit problem we tackle requires us to address the exploration/exploitation/independence trade-off. To do so, we provide a UCB strategy together with a general regret analysis for the case where the size of the independence blocks (the ignored rewards) is fixed and we go a step beyond by providing an algorithm that is able to compute the size of the independence blocks from the data. Finally, we give an analysis of our bandit problem in the restless case, i.e., in the situation where the time counters for all mixing processes simultaneously evolve.

Domaines

Apprentissage [cs.LG]

Fichier principal

mixingbandit-ARXIV.pdf (188.79 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Julien Audiffren : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01011112

Soumis le : lundi 23 juin 2014-09:46:56

Dernière modification le : samedi 27 avril 2024-03:14:16

Archivage à long terme le : mardi 23 septembre 2014-10:46:27

Dates et versions

hal-01011112 , version 1 (23-06-2014)

Identifiants

HAL Id : hal-01011112 , version 1
ARXIV : 1406.6020

Citer

Julien Audiffren, Liva Ralaivola. Stationary Mixing Bandits. 2014. ⟨hal-01011112⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

LIF CNRS UNIV-AMU ENS-CACHAN EC-MARSEILLE INSMI LIS-LAB ENS-PARIS-SACLAY

201 Consultations

52 Téléchargements

Stationary Mixing Bandits

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager