Sparse Stochastic Bandits

Joon Kwon; Vianney Perchet; Claire Vernade

Communication Dans Un Congrès Année : 2017

Sparse Stochastic Bandits

(1) , (2, 3) , (4)

1
2
3
4

Joon Kwon

Fonction : Auteur
PersonId : 181898
IdHAL : joon-kwon
ORCID : 0000-0002-3464-9081
IdRef : 197710840

Centre de Mathématiques Appliquées - Ecole Polytechnique

Vianney Perchet

Fonction : Auteur
PersonId : 871881

Ecole Normale Supérieure Paris-Saclay

Criteo AI Lab

Claire Vernade

Fonction : Auteur

Laboratoire Traitement et Communication de l'Information

Résumé

In the classical multi-armed bandit problem, d arms are available to the decision maker who pulls them sequentially in order to maximize his cumulative reward. Guarantees can be obtained on a relative quantity called regret, which scales linearly with d (or with sqrt(d) in the minimax sense). We here consider the sparse case of this classical problem in the sense that only a small number of arms, namely s < d, have a positive expected reward. We are able to leverage this additional assumption to provide an algorithm whose regret scales with s instead of d. Moreover, we prove that this algorithm is optimal by providing a matching lower bound - at least for a wide and pertinent range of parameters that we determine - and by evaluating its performance on simulated data.

Domaines

Intelligence artificielle [cs.AI] Optimisation et contrôle [math.OC] Statistiques [math.ST]

Vianney Perchet : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03089519

Soumis le : lundi 28 décembre 2020-15:22:53

Dernière modification le : lundi 22 avril 2024-13:05:55

Dates et versions

hal-03089519 , version 1 (28-12-2020)

Identifiants

HAL Id : hal-03089519 , version 1
ARXIV : 1706.01383

Citer

Joon Kwon, Vianney Perchet, Claire Vernade. Sparse Stochastic Bandits. Conference on Learning Theory, Jul 2017, Amsterdam, Netherlands. ⟨hal-03089519⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X INSTITUT-TELECOM CNRS ENS-CACHAN X-CMAP X-DEP-MATHA PARISTECH CMAP TDS-MACS UNIV-PARIS-SACLAY LTCI ENS-PARIS-SACLAY

28 Consultations

0 Téléchargements

Sparse Stochastic Bandits

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager