Skip to Main content Skip to Navigation

Analyse des traces d'usage de Gallica : Une étude à partir des logs de connexions au site Gallica

Abstract : Gallica is one of the major digital libraries available for free via the Internet. In the context of the Bibli-Lab, research partnership between the Bibliothèque nationale de France and Télécom ParisTech, and with the support of TeraLab, a new analysis of Gallica servers’ connection logs was carried out, applying machine-learning methods to them. The aim was not to collect information on users or their profiles but rather to use logs, which act as records of usage, as a basis for identifying typical clickstreams. For 15 months (April 2016-July 2017), a researcher on postdoctoral contract and under the supervision of four of Télécom ParisTech’s research professors, developed a data clusterisation algorithm enabling grouping of Gallica sessions with similarities in sequencing and duration of actions . Logs analysed covered a range of durations, from a week to a month, with systematic checking of the stability of models obtained. The preferred methodological choice was to have statistical models dialogue with results obtained from other approaches (ethnographic observations, interviews, etc. ). Such dialogue enabled the researchers involved to: a) set departure parameters (definition of a session and the elementary actions composing it); b) check models obtained, which were highly sensitive to technical artefacts; and c) propose initial keys to interpretation.
Complete list of metadata

Cited literature [36 references]  Display  Hide  Download
Contributor : Philippe Chevallier <>
Submitted on : Wednesday, February 14, 2018 - 5:39:37 PM
Last modification on : Monday, January 25, 2021 - 4:02:02 PM
Long-term archiving on: : Monday, May 7, 2018 - 9:07:17 PM


rapport analyse des traces d'u...
Files produced by the author(s)


Distributed under a Creative Commons Attribution - NonCommercial - NoDerivatives 4.0 International License


  • HAL Id : hal-01709264, version 1



Adrien Nouvellet, Valérie Beaudouin, Florence d'Alché-Buc, Christophe Prieur, François Roueff. Analyse des traces d'usage de Gallica : Une étude à partir des logs de connexions au site Gallica. [Rapport de recherche] Télécom ParisTech; Bibliothèque nationale de France. 2017. ⟨hal-01709264⟩



Record views


Files downloads