Polynomial Cost of Adaptation for X -Armed Bandits - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2019

Polynomial Cost of Adaptation for X -Armed Bandits

Coût de l'adaptation pour les bandits à continuum de bras

Résumé

In the context of stochastic continuum-armed bandits, we present an algorithm that adapts to the unknown smoothness of the objective function. We exhibit and compute a polynomial cost of adaptation to the Hölder regularity for regret minimization. To do this, we first reconsider the recent lower bound of Locatelli and Carpentier [20], and define and characterize admissible rate functions. Our new algorithm matches any of these minimal rate functions. We provide a finite-time analysis and a thorough discussion about asymptotic optimality.
Fichier principal
Vignette du fichier
arx_bandit_adaptation.pdf (715.48 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02138492 , version 1 (23-05-2019)
hal-02138492 , version 2 (07-12-2019)

Identifiants

Citer

Hédi Hadiji. Polynomial Cost of Adaptation for X -Armed Bandits. 2019. ⟨hal-02138492v1⟩
66 Consultations
40 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More