Bandits and Recommender Systems

Jérémie Mary; Romaric Gaudel; Philippe Preux

doi:10.1007/978-3-319-27926-8_29

Communication Dans Un Congrès Année : 2016

Bandits and Recommender Systems

(1, 2) , (1, 2) , (1, 2)

1
2

Jérémie Mary

Fonction : Auteur
PersonId : 740984
IdHAL : jeremie-mary

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Sequential Learning

Romaric Gaudel

Fonction : Auteur

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Sequential Learning

Philippe Preux

Fonction : Auteur
PersonId : 5488
IdHAL : preux-philippe
IdRef : 059896353

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Sequential Learning

Résumé

This paper addresses the on-line recommendation problem facing new users and new items; we assume that no information is available neither about users, nor about the items. The only source of information is a set of ratings given by users to some items. By on-line, we mean that the set of users, and the set of items, and the set of ratings is evolving along time and that at any moment, the recommendation system has to select items to recommend based on the currently available information, that is basically the sequence of past events. We also mean that each user comes with her preferences which may evolve along short and longer scales of time; so we have to continuously update their preferences. When the set of ratings is the only available source of information , the traditional approach is matrix factorization. In a decision making under uncertainty setting, actions should be selected to balance exploration with exploitation; this is best modeled as a bandit problem. Matrix factors provide a latent representation of users and items. These representations may then be used as contextual information by the bandit algorithm to select items. This last point is exactly the originality of this paper: the combination of matrix factorization and bandit algorithms to solve the on-line recommendation problem. Our work is driven by considering the recommendation problem as a feedback controlled loop. This leads to interactions between the representation learning, and the recommendation policy.

Mots clés

Recommender Systems sequential Recommender Systems Collaborative Filtering Matrix Factorization Multi-Armed Bandtis contextual Bandits

Domaines

Apprentissage [cs.LG]

Fichier principal

Bandits_and_Recommender_Systems.pdf (1.5 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Romaric Gaudel : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01256033

Soumis le : jeudi 14 janvier 2016-11:50:31

Dernière modification le : mercredi 24 janvier 2024-09:54:23

Archivage à long terme le : vendredi 11 novembre 2016-05:26:16

Dates et versions

hal-01256033 , version 1 (14-01-2016)

Identifiants

HAL Id : hal-01256033 , version 1
DOI : 10.1007/978-3-319-27926-8_29

Citer

Jérémie Mary, Romaric Gaudel, Philippe Preux. Bandits and Recommender Systems. First International Workshop on Machine Learning, Optimization, and Big Data (MOD'15), Jul 2015, Taormina, Italy. pp.325-336, ⟨10.1007/978-3-319-27926-8_29⟩. ⟨hal-01256033⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-LILLE

255 Consultations

3060 Téléchargements

Bandits and Recommender Systems

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager