Diversity-Preserving K-Armed Bandits, Revisited

We consider the bandit-based framework for diversity-preserving recommendations introduced by Celis et al. (2019), who approached it in the case of a polytope mainly by a reduction to the setting of linear bandits. We design a UCB algorithm using the specific structure of the setting and show that it enjoys a bounded distribution-dependent regret in the natural cases when the optimal mixed actions put some probability mass on all actions (i.e., when diversity is desirable). The regret lower bounds provided show that otherwise, at least when the model is mean-unbounded, a regret is suffered. We also discuss an example beyond the special case of polytopes.

Domains

Machine Learning [stat.ML]

Fichier principal

GHLS24--Diversity-preserving.pdf (399.38 Ko)

Origin : Files produced by the author(s)

Gilles Stoltz : Connect in order to contact the contributor

https://hal.science/hal-02957485

Submitted on : Friday, April 5, 2024-9:33:52 AM

Last modification on : Saturday, April 27, 2024-3:15:02 AM

Dates and versions

hal-02957485 , version 1 (05-10-2020)

hal-02957485 , version 2 (05-04-2024)

Licence

Attribution

Identifiers

HAL Id : hal-02957485 , version 2
ARXIV : 2010.01874

Cite

Hédi Hadiji, Sébastien Gerchinovitz, Jean-Michel Loubes, Gilles Stoltz. Diversity-Preserving K-Armed Bandits, Revisited. 2024. ⟨hal-02957485v2⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

HEC UNIV-TLSE2 CNRS INRIA INSA-TOULOUSE SUP_LSS INSMI SUP_TELECOMS IMT LM-ORSAY UT1-CAPITOLE CENTRALESUPELEC INRIA2 UNIV-PARIS-SACLAY IRT_SAINT-EXUPERY INSA-GROUPE ANR ANITI GS-MATHEMATIQUES GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT UNIV-UT3 UT3-TOULOUSEINP

310 View

105 Download