Graph Convolutional Network Upper Confident Bound - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2020

Graph Convolutional Network Upper Confident Bound

Mikhail Yurochkin
  • Fonction : Auteur
Mayank Agarwal
  • Fonction : Auteur
Yasaman Khazaeni
  • Fonction : Auteur
Djallel Bouneffouf
  • Fonction : Auteur
  • PersonId : 931776

Résumé

We formulate a new problem at the intersection of semi-supervised learning and contextual bandits, motivated by several applications including clinical trials and ad recommendations. We demonstrate how Graph Convolutional Network (GCN), a semi-supervised learning approach, can be adjusted to the new problem formulation. We also propose a variant of the linear contextual bandit with semi-supervised missing rewards imputation. We then take the best of both approaches to develop multi-GCN embedded contextual bandit. Our algorithms are verified on several real world datasets.
Fichier principal
Vignette du fichier
Online_Semi_Supervised_Learning_with_Bandit_Feedback.pdf (671.15 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02996997 , version 1 (09-11-2020)

Identifiants

  • HAL Id : hal-02996997 , version 1

Citer

Sohini Upadhyay, Mikhail Yurochkin, Mayank Agarwal, Yasaman Khazaeni, Djallel Bouneffouf. Graph Convolutional Network Upper Confident Bound. 2020. ⟨hal-02996997⟩
61 Consultations
141 Téléchargements

Partager

Gmail Facebook X LinkedIn More