Statistically Efficient, Polynomial-Time Algorithms for Combinatorial Semi-Bandits - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Statistically Efficient, Polynomial-Time Algorithms for Combinatorial Semi-Bandits

Thibaut Cuvelier
Richard Combes
Eric Gourdin
  • Fonction : Auteur
  • PersonId : 952670

Résumé

We consider combinatorial semi-bandits over a set X ⊂ {0, 1} d where rewards are uncorrelated across items. For this problem, the algorithm ESCB yields the smallest known regret bound R(T) = O d (ln m) 2 (ln T) ∆ min after T rounds, where m = max x ∈X 1 ⊤ x. However, ESCB has computational complexity O(|X|), which is typically exponential in d, and cannot be used in large dimensions. We propose the first algorithm that is both computationally and statistically efficient for this problem with regret R(T) = O d (ln m) 2 (ln T) ∆ min and computational asymptotic complexity O(δ −1 T poly(d)), where δ T is a function which vanishes arbitrarily slowly. Our approach involves carefully designing AESCB, an approximate version of ESCB with the same regret guarantees. We show that, whenever budgeted linear maximization over X can be solved up to a given approximation ratio, AESCB is implementable in polynomial time O(δ −1 T poly(d)) by repeatedly maximizing a linear function over X subject to a linear budget constraint, and showing how to solve these maximization problems efficiently. Additional algorithms, proofs and numerical experiments are given in the complete version of this work.
Fichier principal
Vignette du fichier
sigmet297-cuvelierA.pdf (663.76 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03201526 , version 1 (19-04-2021)

Identifiants

Citer

Thibaut Cuvelier, Richard Combes, Eric Gourdin. Statistically Efficient, Polynomial-Time Algorithms for Combinatorial Semi-Bandits. SIGMETRICS 2021, ACM, Jun 2021, Virtual Event, China. ⟨10.1145/3410220.3453926⟩. ⟨hal-03201526⟩
76 Consultations
64 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More