[DD15] Filtering Structures for Microblogging Content
Revue Internationale avec comité de lecture :
Journal International Journal of Intelligent Information Technologies (IJIIT),
vol. 11(1),
pp. 30-51,
2015
Mots clés: Microblogging, Filtering, Indexing, Scalability issue
Résumé:
In the last years, microblogging systems have encountered a large success. After 7 years of existence Twitter claims more than 271 million active accounts leading to 500 million tweets per day. Microblogging systems rely on the all-or-nothing paradigm: a user receives all the posts from an account s/he follows. A consequence for a user is the risk of flooding, i.e., the number of posts received from all the accounts s/he follows implies a time-consuming scan of her/his feed to find relevant updates that match his interests. To avoid user flooding and to significantly diminish the number of posts to be delivered by the system, we propose to introduce filtering on top of microblogging systems. Driven by the shape of the data, we designed different filtering structures and compared them analytically as well as experimentally on a Twitter dataset which consists of more than 2.1 million users, 15.7 million tweets and 148.5 million publisher-follower relationships.