R-UCB: a Contextual Bandit Algorithm for Risk-Aware Recommender Systems

Djallel Bouneffouf

Pré-Publication, Document De Travail Année : 2014

R-UCB: a Contextual Bandit Algorithm for Risk-Aware Recommender Systems

(1, 2, 3)

1
2
3

Djallel Bouneffouf

Fonction : Auteur
PersonId : 931776

Département Informatique

Services répartis, Architectures, MOdélisation, Validation, Administration des Réseaux

Centre National de la Recherche Scientifique

Résumé

Mobile Context-Aware Recommender Systems can be naturally modelled as an exploration/exploitation trade-o ff (exr/exp) problem, where the system has to choose between maximizing its expected rewards dealing with its current knowledge (exploitation) and learning more about the unknown user's preferences to improve its knowledge (exploration). This problem has been addressed by the reinforcement learning community but they do not consider the risk level of the current user's situation, where it may be dangerous to recommend items the user may not desire in her current situation if the risk level is high. We introduce in this paper an algorithm named R-UCB that considers the risk level of the user's situation to adaptively balance between exr and exp. The detailed analysis of the experimental results reveals several important discoveries in the exr/exp behaviour.

Mots clés

Contextual Bandit Recommender Systems Risk-Aware

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

R-UCB_a_Contextual_Bandit_Algorithm_for_Risk-Aware_Recommender_Systems.pdf (815.65 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Djallel Bouneffouf : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01054183

Soumis le : mardi 5 août 2014-12:14:38

Dernière modification le : vendredi 24 mars 2023-14:52:59

Archivage à long terme le : mercredi 26 novembre 2014-00:30:51

Dates et versions

hal-01054183 , version 1 (05-08-2014)

Identifiants

HAL Id : hal-01054183 , version 1

Citer

Djallel Bouneffouf. R-UCB: a Contextual Bandit Algorithm for Risk-Aware Recommender Systems. 2014. ⟨hal-01054183⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM CNRS TELECOM-SUDPARIS

143 Consultations

374 Téléchargements

R-UCB: a Contextual Bandit Algorithm for Risk-Aware Recommender Systems

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager