Non-Asymptotic Sequential Tests for Overlapping Hypotheses and application to near optimal arm identification in bandit models

Aurélien Garivier; Emilie Kaufmann

Article Dans Une Revue Sequential Analysis Année : 2021

Non-Asymptotic Sequential Tests for Overlapping Hypotheses and application to near optimal arm identification in bandit models

(1, 2) , (3, 4)

1
2
3
4

Aurélien Garivier

Fonction : Auteur
PersonId : 4986
IdHAL : aurelien-garivier
ORCID : 0000-0002-4906-9573
IdRef : 111902495

Unité de Mathématiques Pures et Appliquées

Modèles de calcul, Complexité, Combinatoire

Emilie Kaufmann

Fonction : Auteur
PersonId : 10422
IdHAL : emilie-kaufmann
ORCID : 0000-0002-5496-824X
IdRef : 197040810

Scool

Centre National de la Recherche Scientifique

Résumé

In this paper, we study sequential testing problems with overlapping hypotheses. We first focus on the simple problem of assessing if the mean µ of a Gaussian distribution is $≥ ε− or ≤ε; if µ ∈ (−ε,ε)$, both answers are considered to be correct. Then, we consider PAC-best arm identification in a bandit model: given K probability distributions on R with means $µ_1,. .. , µ_K$ , we derive the asymptotic complexity of identifying, with risk at most $δ$, an index $I ∈ {1,. .. , K}$ such that $µ_I ≥ max_i µ_i −ε$. We provide non asymptotic bounds on the error of a parallel General Likelihood Ratio Test, which can also be used for more general testing problems. We further propose lower bound on the number of observation needed to identify a correct hypothesis. Those lower bounds rely on information-theoretic arguments, and specifically on two versions of a change of measure lemma (a high-level form, and a low-level form) whose relative merits are discussed.

Mots clés

Sequential statistics bandit models tests Best arm identification Generalized Likelihood Ratio test Multi-armed bandits Sequential testing

Domaines

Statistiques [math.ST] Machine Learning [stat.ML]

Fichier principal

GK_SQA.pdf (331.76 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emilie Kaufmann : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02123833

Soumis le : mercredi 17 novembre 2021-22:56:09

Dernière modification le : jeudi 14 mars 2024-03:14:49

Dates et versions

hal-02123833 , version 1 (09-05-2019)

hal-02123833 , version 2 (17-11-2021)

Identifiants

HAL Id : hal-02123833 , version 2
ARXIV : 1905.03495

Citer

Aurélien Garivier, Emilie Kaufmann. Non-Asymptotic Sequential Tests for Overlapping Hypotheses and application to near optimal arm identification in bandit models. Sequential Analysis, 2021. ⟨hal-02123833v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-LYON CNRS INRIA UNIV-LYON1 CRISTAL INRIA2 UNIV-LILLE UDL CRISTAL-SCOOL

154 Consultations

169 Téléchargements

Non-Asymptotic Sequential Tests for Overlapping Hypotheses and application to near optimal arm identification in bandit models

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager