Open Problem: Risk of Ruin in Multiarmed Bandits - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Open Problem: Risk of Ruin in Multiarmed Bandits

Résumé

We formalize a particular class of problems called survival multiarmed bandits (S-MAB), which constitutes a modified version of budgeted multiarmed bandits (B-MAB) where a true risk of ruin must be considered, bringing it closer to risk-averse multiarmed bandits (RA-MAB). In a S-MAB, pulling an arm can result in both positive and negative rewards. The agent has an initial budget that evolves in time with the received rewards. The goal is finding a good exploration-exploitationsafety trade-off, maximizing rewards while minimizing the probability of getting ruined (i.e. hitting a negative budget). Such simple and until now neglected modification in the MAB statement changes the way to approach the problem, asking for adapted algorithms and specific analytical tools, and also make it more likely related to some important real-world applications. We are interested in the following open problems which stem from such new MAB definition: (a) how can the regret be meaningfully defined in formal terms for a S-MAB given its multiobjective optimization nature? (b) can a S-MAB be reduced to a RA-MAB or a B-MAB, transferring their theoretical guarantees? (c) what kind of method or strategy must an agent follow to optimally solve a S-MAB?
Fichier principal
Vignette du fichier
2019___COLT__CR____Risk_of_Ruin_in_Multiarmed_Bandits.pdf (146.34 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02363609 , version 1 (17-03-2021)

Identifiants

  • HAL Id : hal-02363609 , version 1

Citer

Filipo Studzinski Perotto, Mathieu Bourgais, Laurent Vercouter, Bruno Castro da Silva. Open Problem: Risk of Ruin in Multiarmed Bandits. Conference on Learning Theory (COLT), Jun 2019, Phoenix, United States. ⟨hal-02363609⟩
44 Consultations
83 Téléchargements

Partager

Gmail Facebook X LinkedIn More