Skip to Main content Skip to Navigation
Conference papers

Efficient Filtering in Micro-blogging Systems: We Won't Get Flooded Again

Ryadh Dahimene 1 Cedric Du Mouza 1 Michel Scholl 2
1 CEDRIC - ISID - CEDRIC. Ingénierie des Systèmes d'Information et de Décision
CEDRIC - Centre d'études et de recherche en informatique et communications
2 CEDRIC - VERTIGO - CEDRIC. Bases de données avancées
CEDRIC - Centre d'études et de recherche en informatique et communications
Abstract : In the last years, micro-blogging systems have encountered a large success. Twitter for instance claims more than 200 million accounts after 5 years of existence with more than 200 million tweets a day leading to 350 billion delivered tweets. Micro-blogging systems rely on the all-or-nothing paradigm: a user receives all the posts from an account s/he follows. A consequence for a user is the risk of flooding, i.e., the number of posts received from all the accounts s/he follows implies a time-consuming scan of his list of postings to read news that match his interests. Meanwhile these systems receive all posts and deliver each of them to all the followers of the publishing accounts, whether they are interested by the news or not. To avoid user flooding and to significantly diminish the number of posts to be delivered, we propose in this paper three filtering structures for micro-blogging systems. They allow to efficiently retrieve the followers of an account that could be interested by a post s/he published. We compare analytically these structures and confirm our analysis experimentally on synthetical datasets and on a real Twitter dataset which consists of more than 2.1 million users, 15.7 million tweets and 148.5 million publisher-follower relationships.
Keywords : filter index Twitter
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01126111
Contributor : Laboratoire Cedric <>
Submitted on : Friday, March 6, 2015 - 11:40:30 AM
Last modification on : Monday, February 17, 2020 - 10:46:11 PM

Identifiers

  • HAL Id : hal-01126111, version 1

Collections

Citation

Ryadh Dahimene, Cedric Du Mouza, Michel Scholl. Efficient Filtering in Micro-blogging Systems: We Won't Get Flooded Again. Intl. IEEE Conf. on Scientific and Statistical Databases (SSDBM'12), Jun 2012, Chania, Greece. pp.168-176. ⟨hal-01126111⟩

Share

Metrics

Record views

41