Spectral Bandits for Smooth Graph Functions

Michal Valko; Rémi Munos; Branislav Kveton; Tomáš Kocák

Communication Dans Un Congrès Année : 2014

Spectral Bandits for Smooth Graph Functions

(1) , (2, 1) , (3) , (1)

1
2
3

Michal Valko

Fonction : Auteur
PersonId : 284
IdHAL : michal
IdRef : 22360934X

Sequential Learning

Rémi Munos

Fonction : Auteur
PersonId : 836863

Microsoft Research [Cambridge]

Sequential Learning

Branislav Kveton

Fonction : Auteur

Technicolor Research [Palo Alto]

Tomáš Kocák

Fonction : Auteur
PersonId : 955512

Sequential Learning

Résumé

Smooth functions on graphs have wide applications in manifold and semi-supervised learning. In this paper, we study a bandit problem where the payoffs of arms are smooth on a graph. This framework is suitable for solving online learning problems that involve graphs, such as content-based recommendation. In this problem, each item we can recommend is a node and its expected rating is similar to its neighbors. The goal is to recommend items that have high expected ratings. We aim for the algorithms where the cumulative regret with respect to the optimal policy would not scale poorly with the number of nodes. In particular, we introduce the notion of an effective dimension, which is small in real-world graphs, and propose two algorithms for solving our problem that scale linearly and sublinearly in this dimension. Our experiments on real-world content recommendation problem show that a good estimator of user preferences for thousands of items can be learned from just tens of nodes evaluations.

Domaines

Machine Learning [stat.ML] Bibliothèque électronique [cs.DL]

Fichier principal

valko2014spectral.pdf (448.56 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Michal Valko : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00986818

Soumis le : mardi 20 mai 2014-00:24:34

Dernière modification le : vendredi 24 mars 2023-14:52:58

Archivage à long terme le : mercredi 20 août 2014-10:55:14

Dates et versions

hal-00986818 , version 1 (05-05-2014)

hal-00986818 , version 2 (16-05-2014)

hal-00986818 , version 3 (20-05-2014)

Identifiants

HAL Id : hal-00986818 , version 3

Citer

Michal Valko, Rémi Munos, Branislav Kveton, Tomáš Kocák. Spectral Bandits for Smooth Graph Functions. International Conference on Machine Learning, May 2014, Beijing, China. ⟨hal-00986818v3⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LAGIS CRISTAL INRIA2 CRISTAL-SEQUEL

519 Consultations

556 Téléchargements

Spectral Bandits for Smooth Graph Functions

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager