Skip to Main content Skip to Navigation
Journal articles

An Analytical Study of Large SPARQL Query Logs

Angela Bonifati 1, 2 Wim Martens 3 Thomas Timm 3
1 BD - Base de Données
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
2 TYREX - Types and Reasoning for the Web
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
Abstract : With the adoption of RDF as the data model for Linked Data and the Semantic Web, query specification from end-users has become more and more common in SPARQL endpoints. In this paper, we conduct an in-depth analytical study of the queries formulated by end-users and harvested from large and up-to-date structured query logs from a wide variety of RDF data sources. As opposed to previous studies, ours is the first assessment on a voluminous query corpus, spanning over several years and covering many representative SPARQL endpoints. Apart from the syntactical structure of the queries, that exhibits already interesting results on this generalized corpus, we drill deeper in the structural characteristics related to the graph and hypergraph representation of queries. We outline the most common shapes of queries when visually displayed as undirected graphs, characterize their tree width, length of their cycles, maximal degree of nodes, and more. For queries that cannot be adequately represented as graphs, we investigate their hypergraphs and hypertree width. Moreover, we analyze the evolution of queries over time, by introducing the novel concept of a streak, i.e., a sequence of queries that appear as subsequent modifications of a
Document type :
Journal articles
Complete list of metadata
Contributor : Angela Bonifati Connect in order to contact the contributor
Submitted on : Friday, January 22, 2021 - 11:14:15 AM
Last modification on : Wednesday, November 3, 2021 - 6:45:49 AM
Long-term archiving on: : Friday, April 23, 2021 - 7:01:26 PM


Files produced by the author(s)



Angela Bonifati, Wim Martens, Thomas Timm. An Analytical Study of Large SPARQL Query Logs. The VLDB Journal, Springer, 2020, 29 (2-3), pp.655-679. ⟨10.1007/s00778-019-00558-9⟩. ⟨hal-03118422⟩



Les métriques sont temporairement indisponibles