Users Are Known by the Company They Keep: Topic Models for Viewpoint Discovery in Social Networks - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Users Are Known by the Company They Keep: Topic Models for Viewpoint Discovery in Social Networks

Résumé

Social media platforms such as weblogs and social networking sites provide Internet users with an unprecedented means to express their opinions and debate on a wide range of issues. Concurrently with their growing importance in public communication, social media platforms may foster echo chambers and filter bubbles: homophily and content personalization lead users to be increasingly exposed to conforming opinions. There is therefore a need for unbiased systems able to identify and provide access to varied viewpoints. To address this task, we propose in this paper a novel unsupervised topic model, the Social Network Viewpoint Discovery Model (SNVDM). Given a specific issue (e.g., U.S. policy) as well as the text and social interactions from the users discussing this issue on a social networking site, SNVDM jointly identifies the issue's topics, the users' viewpoints, and the discourse pertaining to the different topics and viewpoints. In order to overcome the potential sparsity of the social network (i.e., some users interact with only a few other users), we propose an extension to SNVDM based on the Generalized Pólya Urn sampling scheme (SNVDM-GPU) to leverage "acquaintances of acquaintances" relationships. We benchmark the different proposed models against three baselines, namely TAM, SN-LDA, and VODUM, on a viewpoint clustering task using two real-world datasets. We thereby provide evidence that our model SNVDM and its extension SNVDM-GPU significantly outperform state-of-the-art baselines, and we show that utilizing social interactions greatly improves viewpoint clustering performance.
Fichier principal
Vignette du fichier
thonet_22090.pdf (283.15 Ko) Télécharger le fichier
thonet_22090_bis.pdf (2.6 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02611113 , version 1 (18-05-2020)

Identifiants

Citer

Thibaut Thonet, Guillaume Cabanac, Mohand Boughanem, Karen Pinel-Sauvagnat. Users Are Known by the Company They Keep: Topic Models for Viewpoint Discovery in Social Networks. International Conference on Information and Knowledge Management (CIKM 2017), ACM SIGWEB: Special Interest Group on Hypertext, Hypermedia and Web; ACM SIGIR: Special Interest Group on Information Retrieval, Nov 2017, Singapore, Singapore. pp.87--96, ⟨10.1145/3132847.3132897⟩. ⟨hal-02611113⟩
77 Consultations
59 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More