Infinitely many-armed bandits

Yizao Wang; Jean-Yves Audibert; Rémi Munos

Communication Dans Un Congrès Année : 2008

Infinitely many-armed bandits

(1) , (2, 3, 4) , (5)

1
2
3
4
5

Yizao Wang

Fonction : Auteur

Centre de Mathématiques Appliquées - Ecole Polytechnique

Jean-Yves Audibert

Fonction : Auteur
PersonId : 931557

imagine [Marne-la-Vallée]

Laboratoire d'Informatique Gaspard-Monge

Statistical Machine Learning and Parsimony

Rémi Munos

Fonction : Auteur
PersonId : 836863

Sequential Learning

Résumé

We consider multi-armed bandit problems where the number of arms is larger than the possible number of experiments. We make a stochastic assumption on the mean-reward of a new selected arm which characterizes its probability of being a near-optimal arm. Our assumption is weaker than in previous works. We describe algorithms based on upper-confidence-bounds applied to a restricted set of randomly selected arms and provide upper-bounds on the resulting expected regret. We also derive a lower-bound which matches (up to a logarithmic factor) the upper-bound in some cases.

Domaines

Apprentissage [cs.LG]

Fichier principal

many-armed.pdf (145.07 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Rémi Munos : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00830178

Soumis le : mardi 4 juin 2013-15:19:36

Dernière modification le : vendredi 19 avril 2024-16:18:57

Archivage à long terme le : jeudi 5 septembre 2013-04:23:06

Dates et versions

hal-00830178 , version 1 (04-06-2013)

Identifiants

HAL Id : hal-00830178 , version 1

Citer

Yizao Wang, Jean-Yves Audibert, Rémi Munos. Infinitely many-armed bandits. Advances in Neural Information Processing Systems, 2008, Canada. ⟨hal-00830178⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X ENS-PARIS ENPC UNIV-LILLE3 CNRS INRIA UNIV-MLV LIGM_A3SI X-CMAP X-DEP-MATHA PARISTECH LAGIS LIGM IMAGINE CMAP INRIA2 PSL ESIEE-PARIS UNIV-EIFFEL JSE2024

608 Consultations

215 Téléchargements

Infinitely many-armed bandits

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager