Dynamic Data Capture from Social Media Streams: A Contextual Bandit Approach.

Thibault Gisselbrecht 1, 2, * Sylvain Lamprier 2 Patrick Gallinari 2
* Auteur correspondant
2 MLIA - Machine Learning and Information Access
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : Social Media usually provide streaming data access that enable dynamic capture of the social activity of their users. Leveraging such APIs for collecting social data that satisfy a given pre-defined need may constitute a complex task, that implies careful stream selections. With user-centered streams, it indeed comes down to the problem of choosing which users to follow in order to maximize the utility of the collected data w.r.t. the need. On large social media, this represents a very challenging task due to the huge number of potential targets and restricted access to the data. Because of the intrinsic non-stationarity of user's behavior, a relevant target today might be irrelevant tomorrow, which represents a major difficulty to apprehend. In this paper, we propose a new approach that anticipates which profiles are likely to publish relevant contents-given a predefined need-in the future, and dynamically selects a subset of accounts to follow at each iteration. Our method has the advantage to take into account both API restrictions and the dynamics of users' behaviors. We formalize the task as a contextual bandit problem with multiple actions selection. We finally conduct experiments on Twitter, which demonstrate the empirical effectiveness of our approach in real-world settings.
Type de document :
Communication dans un congrès
Tenth International Conference on Web and Social Media, ICWSM 2016, May 2016, Cologne, Germany. pp.130-139, 2016
Liste complète des métadonnées

Littérature citée [21 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01355400
Contributeur : Thibault Gisselbrecht <>
Soumis le : mardi 23 août 2016 - 12:03:25
Dernière modification le : jeudi 22 novembre 2018 - 14:31:13

Identifiants

  • HAL Id : hal-01355400, version 1

Citation

Thibault Gisselbrecht, Sylvain Lamprier, Patrick Gallinari. Dynamic Data Capture from Social Media Streams: A Contextual Bandit Approach.. Tenth International Conference on Web and Social Media, ICWSM 2016, May 2016, Cologne, Germany. pp.130-139, 2016. 〈hal-01355400〉

Partager

Métriques

Consultations de la notice

249