Covariance-adapting algorithm for semi-bandits with application to sparse outcomes - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Covariance-adapting algorithm for semi-bandits with application to sparse outcomes

Michal Valko

Résumé

We investigate stochastic combinatorial semi-bandits, where the entire joint distribution of outcomes impacts the complexity of the problem instance (unlike in the standard bandits). Typical distributions considered depend on specific parameter values, whose prior knowledge is required in theory but quite difficult to estimate in practice; an example is the commonly assumed sub-Gaussian family. We alleviate this issue by instead considering a new general family of sub-exponential distributions, which contains bounded and Gaussian ones. We prove a new lower bound on the regret on this family, that is parameterized by the unknown covariance matrix, a tighter quantity than the sub-Gaussian matrix. We then construct an algorithm that uses covariance estimates, and provide a tight asymptotic analysis of the regret. Finally, we apply and extend our results to the family of sparse outcomes, which has applications in many recommender systems.
Fichier principal
Vignette du fichier
colt.pdf (660.04 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02876102 , version 1 (20-06-2020)

Identifiants

  • HAL Id : hal-02876102 , version 1

Citer

Pierre Perrault, Vianney Perchet, Michal Valko. Covariance-adapting algorithm for semi-bandits with application to sparse outcomes. Conference on Learning Theory, 2020, Graz, Austria. ⟨hal-02876102⟩
95 Consultations
183 Téléchargements

Partager

Gmail Facebook X LinkedIn More