Stochastic bandits with vector losses: Minimizing $\ell^\infty$-norm of relative losses

Xuedong Shang; Han Shao; Jian Qian

Pré-Publication, Document De Travail Année : 2020

Stochastic bandits with vector losses: Minimizing $\ell^\infty$-norm of relative losses

(1, 2) , (3) , (4)

1
2
3
4

Xuedong Shang

Fonction : Auteur
PersonId : 21274
IdHAL : xuedong-shang
ORCID : 0000-0002-1537-6540
IdRef : 259274526

Scool

Sequential Learning

Han Shao

Fonction : Auteur

Toyota Technological Institute

Jian Qian

Fonction : Auteur

Massachusetts Institute of Technology

Résumé

Multi-armed bandits are widely applied in scenarios like recommender systems, for which the goal is to maximize the click rate. However, more factors should be considered, e.g., user stickiness, user growth rate, user experience assessment, etc. In this paper, we model this situation as a problem of K-armed bandit with multiple losses. We define relative loss vector of an arm where the i-th entry compares the arm and the optimal arm with respect to the i-th loss. We study two goals: (a) finding the arm with the minimum $\ell^\infty$-norm of relative losses with a given confidence level (which refers to fixed-confidence best-arm identification); (b) minimizing the $\ell^\infty$-norm of cumulative relative losses (which refers to regret minimization). For goal (a), we derive a problem-dependent sample complexity lower bound and discuss how to achieve matching algorithms. For goal (b), we provide a regret lower bound of Ω(T 2/3) and provide a matching algorithm.

Domaines

Apprentissage [cs.LG]

Fichier principal

shang2020vector.pdf (451.55 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Xuedong Shang : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02968536

Soumis le : jeudi 15 octobre 2020-18:36:49

Dernière modification le : mercredi 24 janvier 2024-09:54:23

Dates et versions

hal-02968536 , version 1 (15-10-2020)

Identifiants

HAL Id : hal-02968536 , version 1

Citer

Xuedong Shang, Han Shao, Jian Qian. Stochastic bandits with vector losses: Minimizing $\ell^\infty$-norm of relative losses. 2020. ⟨hal-02968536⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-LILLE CRISTAL-SCOOL

77 Consultations

86 Téléchargements

Stochastic bandits with vector losses: Minimizing $\ell^\infty$-norm of relative losses

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager