Multi-armed Bandit, Dynamic Environments and Meta-Bandits - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2006

Multi-armed Bandit, Dynamic Environments and Meta-Bandits

Résumé

This paper presents the Adapt-EvE algorithm, extending the UCBT online learning algorithm (Auer et al. 2002) to abruptly changing environments. Adapt-EvE features an adaptive change-point detection test based on Page-Hinkley statistics, and two alternative xtra-exploration procedures respectively based on smooth-restart and Meta-Bandits.
Fichier principal
Vignette du fichier
MetaEve.pdf (106.66 Ko) Télécharger le fichier
Loading...

Dates et versions

hal-00113668 , version 1 (14-11-2006)

Identifiants

  • HAL Id : hal-00113668 , version 1

Citer

Cédric Hartland, Sylvain Gelly, Nicolas Baskiotis, Olivier Teytaud, Michèle Sebag. Multi-armed Bandit, Dynamic Environments and Meta-Bandits. 2006. ⟨hal-00113668⟩
875 Consultations
1765 Téléchargements

Partager

Gmail Facebook X LinkedIn More