D. Cai, S. Yu, J. R. Wen, and W. Y. Ma, Vips: a vision-based page segmentation algorithm, 2003.

, CERN: The document that officially put the world wide web into the public domain, 1993.

D. Diminescu, e-Diasporas Atlas. Explorations and Cartography of Diasporas on Digital Networks. Ed. de la Maison des Sciences de l'Homme, 2012.

G. P. Fung, J. X. Yu, P. S. Yu, and H. Lu, Parameter free bursty events detection in text streams, Proceedings of the 31st international conference on Very large data bases, pp.181-192, 2005.

A. Jatowt, Y. Kawai, and K. Tanaka, Detecting age of page content, Proceedings of the 9th annual ACM international workshop on Web information and data management, pp.137-144, 2007.

B. Kahle, Preserving the internet, Scientific American pp, vol.276, pp.82-83, 1997.

C. Kohlschütter, P. Fankhauser, and W. Nejdl, Boilerplate detection using shallow text features, Proceedings of the Third ACM International Conference on Web Search and Data Mining, pp.441-450, 2010.

J. Masanès, Web Archiving, 2006.