Graph Convolutional Network Upper Confident Bound
Résumé
We formulate a new problem at the intersection of semi-supervised learning and contextual bandits, motivated by several applications including clinical trials and ad recommendations. We demonstrate how Graph Convolutional Network (GCN), a semi-supervised learning approach, can be adjusted to the new problem formulation. We also propose a variant of the linear contextual bandit with semi-supervised missing rewards imputation. We then take the best of both approaches to develop multi-GCN embedded contextual bandit. Our algorithms are verified on several real world datasets.
Domaines
Informatique et langage [cs.CL]
Fichier principal
Online_Semi_Supervised_Learning_with_Bandit_Feedback.pdf (671.15 Ko)
Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...