Optimal Best Arm Identification with Fixed Confidence

Aurélien Garivier; Emilie Kaufmann

Communication Dans Un Congrès Année : 2016

Optimal Best Arm Identification with Fixed Confidence

(1) , (2, 3, 4)

1
2
3
4

Aurélien Garivier

Fonction : Auteur
PersonId : 4986
IdHAL : aurelien-garivier
ORCID : 0000-0002-4906-9573
IdRef : 111902495

Institut de Mathématiques de Toulouse UMR5219

Emilie Kaufmann

Fonction : Auteur
PersonId : 10422
IdHAL : emilie-kaufmann
ORCID : 0000-0002-5496-824X
IdRef : 197040810

Centre National de la Recherche Scientifique

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Sequential Learning

Résumé

We give a complete characterization of the complexity of best-arm identification in one-parameter bandit problems. We prove a new, tight lower bound on the sample complexity. We propose the `Track-and-Stop' strategy, which we prove to be asymptotically optimal. It consists in a new sampling rule (which tracks the optimal proportions of arm draws highlighted by the lower bound) and in a stopping rule named after Chernoff, for which we give a new analysis.

Mots clés

MDL best arm identification multi-armed bandits

Domaines

Statistiques [math.ST] Machine Learning [stat.ML]

Fichier principal

MDLBAI.pdf (323.3 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Aurélien Garivier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01273838

Soumis le : mercredi 1 juin 2016-14:25:34

Dernière modification le : lundi 22 avril 2024-10:17:33

Archivage à long terme le : vendredi 2 septembre 2016-10:27:56

Dates et versions

hal-01273838 , version 1 (14-02-2016)

hal-01273838 , version 2 (01-06-2016)

Identifiants

HAL Id : hal-01273838 , version 2
ARXIV : 1602.04589

Citer

Aurélien Garivier, Emilie Kaufmann. Optimal Best Arm Identification with Fixed Confidence. 29th Annual Conference on Learning Theory (COLT), Jun 2016, New York, United States. ⟨hal-01273838v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 CNRS INRIA INSA-TOULOUSE IMT UT1-CAPITOLE CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-LILLE INSA-GROUPE INSA-TOULOUSE-GEI ANR UNIV-UT3 UT3-TOULOUSEINP

667 Consultations

342 Téléchargements

Optimal Best Arm Identification with Fixed Confidence

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager