Skip to Main content Skip to Navigation
Conference papers

Revealing Historical Events out of Web Archives

Quentin Lobbé 1, 2
1 VALDA - Value from Data
DI-ENS - Département d'informatique - ENS Paris, Inria de Paris
Abstract : As the living Web expands, worldwide volumes of Web archives constantly increase, making difficult to identify relevant archived contents. Here we propose an application for detecting historical events out of a corpus of Web archives and based on an entity called Web Fragment: a semantic and syntactic subset of a given Web page. The Web fragment has the particularity to be indexed by its edition date instead of its archiving date. We apply our framework on an archived Moroccan forum and witness how it reacted to the Arab Spring at the end of 2010.
Document type :
Conference papers
Complete list of metadata

Cited literature [8 references]  Display  Hide  Download
Contributor : Quentin Lobbé Connect in order to contact the contributor
Submitted on : Monday, October 15, 2018 - 4:16:19 PM
Last modification on : Friday, January 21, 2022 - 3:16:27 AM
Long-term archiving on: : Wednesday, January 16, 2019 - 3:49:31 PM


Files produced by the author(s)


  • HAL Id : hal-01895951, version 1


Quentin Lobbé. Revealing Historical Events out of Web Archives. 22nd International Conference on Theory and Practice of Digital Libraries (TPDL 2018), Sep 2018, Porto, Portugal. ⟨hal-01895951⟩



Les métriques sont temporairement indisponibles