Combining Semantic and Linguistic Representations for Media Recommendation - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Multimedia Systems Année : 2022

Combining Semantic and Linguistic Representations for Media Recommendation

Ismail Harrando
  • Fonction : Auteur
  • PersonId : 1135798
Raphaël Troncy

Résumé

Content-based recommendation systems offer the possibility of promoting media (e.g. posts, videos, podcasts) to users based solely on a representation of the content (i.e. without using any user-related data such as views or interactions between users and items). In this work, we study the potential of using different textual representations (based on the content of the media) and semantic representations (created from a knowledge graph of media metadata). We also show that by using off-the-shelf automatic annotation tools from the Information Extraction literature, we can improve recommendation performance, without any extra cost of training, data collection or annotation. We first evaluate multiple textual content representations on two tasks of recommendation: user-specific, which is performed by suggesting new items to the user given a history of interactions, and item-based, which is based solely on content relatedness, and is rarely investigated in the literature of recommender systems. We compare how using automatically extracted content (via ASR) compares to using human-written summaries. We then derive a semantic content representation by combining manually created metadata and automatically extracted annotations and we show that Knowledge Graphs, through their embeddings, constitute a great modality to seamlessly integrate extracted knowledge to legacy metadata and can be used to provide good content recommendations. We finally study how combining both semantic and textual representations can lead to superior performance on both recommendation tasks. Our code is available at https: //github.com/D2KLab/ka-recsys to support experiment reproducibility.
Fichier principal
Vignette du fichier
publi-6916.pdf (475.77 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03685011 , version 1 (01-06-2022)

Identifiants

  • HAL Id : hal-03685011 , version 1

Citer

Ismail Harrando, Raphaël Troncy. Combining Semantic and Linguistic Representations for Media Recommendation. Multimedia Systems, In press, Data-driven Personalization of television content. ⟨hal-03685011⟩

Collections

EURECOM ANR
21 Consultations
65 Téléchargements

Partager

Gmail Facebook X LinkedIn More