Best-Effort Refresh Strategies for Content-Based RSS Feed Aggregation - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Best-Effort Refresh Strategies for Content-Based RSS Feed Aggregation

Roxana Horincar
  • Fonction : Auteur
  • PersonId : 976365
Bernd Amann

Résumé

During the past several years RSS-based content syndication has become a standard technique for efficiently and timely disseminating information on the web. From a data processing perspective RSS feeds are standard XML resources which are periodically refreshed by feed aggregators for generating continuous streams of items. In this article, we study the problem of information loss in the context of a content-based feed aggregation system and we propose a new best-effort refresh strategy for RSS feeds under limited bandwidth. This strategy is evaluated experimentally and compared to other state-of-the-art crawling strategies for web pages.

Dates et versions

hal-01292109 , version 1 (22-03-2016)

Identifiants

Citer

Roxana Horincar, Bernd Amann, Thierry Artières. Best-Effort Refresh Strategies for Content-Based RSS Feed Aggregation. The 11th international conference on Web information systems engineering (WISE 2010), Dec 2010, Hong Kong, China. pp.262-270, ⟨10.1007/978-3-642-17616-6_24⟩. ⟨hal-01292109⟩
48 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More