R-UCB: a Contextual Bandit Algorithm for Risk-Aware Recommender Systems - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2014

R-UCB: a Contextual Bandit Algorithm for Risk-Aware Recommender Systems

Résumé

Mobile Context-Aware Recommender Systems can be naturally modelled as an exploration/exploitation trade-o ff (exr/exp) problem, where the system has to choose between maximizing its expected rewards dealing with its current knowledge (exploitation) and learning more about the unknown user's preferences to improve its knowledge (exploration). This problem has been addressed by the reinforcement learning community but they do not consider the risk level of the current user's situation, where it may be dangerous to recommend items the user may not desire in her current situation if the risk level is high. We introduce in this paper an algorithm named R-UCB that considers the risk level of the user's situation to adaptively balance between exr and exp. The detailed analysis of the experimental results reveals several important discoveries in the exr/exp behaviour.
Fichier principal
Vignette du fichier
R-UCB_a_Contextual_Bandit_Algorithm_for_Risk-Aware_Recommender_Systems.pdf (815.65 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01054183 , version 1 (05-08-2014)

Identifiants

  • HAL Id : hal-01054183 , version 1

Citer

Djallel Bouneffouf. R-UCB: a Contextual Bandit Algorithm for Risk-Aware Recommender Systems. 2014. ⟨hal-01054183⟩
143 Consultations
374 Téléchargements

Partager

Gmail Facebook X LinkedIn More