X -ARMED BANDITS: OPTIMIZING QUANTILES, CVAR AND OTHER RISKS - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

X -ARMED BANDITS: OPTIMIZING QUANTILES, CVAR AND OTHER RISKS

Résumé

We propose and analyze StoROO, an algorithm for risk optimization on stochastic black-box functions derived from StoOO. Motivated by risk-averse decision making fields like agriculture, medicine, biology or finance, we do not focus on the mean payoff but on generic functionals of the return distribution. We provide a generic regret analysis of StoROO and illustrate its applicability with two examples: the optimization of quantiles and CVaR. Inspired by the bandit literature and black-box mean optimizers, StoROO relies on the possibility to construct confidence intervals for the targeted functional based on random-size samples. We detail their construction in the case of quantiles, providing tight bounds based on Kullback-Leibler divergence. We finally present numerical experiments that show a dramatic impact of tight bounds for the optimization of quantiles and CVaR.
Fichier principal
Vignette du fichier
Hal_bandit(1).pdf (1.79 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02101647 , version 1 (17-04-2019)
hal-02101647 , version 2 (30-10-2019)
hal-02101647 , version 3 (04-03-2020)

Identifiants

  • HAL Id : hal-02101647 , version 3

Citer

Léonard Torossian, Aurélien Garivier, Victor Picheny. X -ARMED BANDITS: OPTIMIZING QUANTILES, CVAR AND OTHER RISKS. Asian Conference on Machine Learning, Nov 2019, Nagoya, Japan. pp.252-267. ⟨hal-02101647v3⟩
159 Consultations
73 Téléchargements

Partager

Gmail Facebook X LinkedIn More