Parallel Data Loading during Querying Deep Web and Linked Open Data with SPARQL

Abstract : Web integration systems are able to provide transparent and uniform access to heterogeneous Web data sources by integrating views of Linked Data, Web Service results, or data extracted from the Deep Web. However, given the potential large number of views, query engines of Web integration systems have to implement execution techniques able to scale up to real-world scenarios and efficiently execute queries. We tackle the problem of SPARQL query processing against RDF views, and propose a non-blocking query execution strategy that incrementally accesses and merges the views relevant to a SPARQL query in a parallel fashion. The proposed strategy is implemented on top of Jena 2.7.4, and empirically compared with SemLAV, a sequential SPARQL query engine on RDF views. Results suggest that our approach outperforms SemLAV in terms of the number of answers produced per unit of time.
Document type :
Conference papers
Complete list of metadatas

Cited literature [17 references]  Display  Hide  Download
Contributor : Pauline Folz <>
Submitted on : Tuesday, January 26, 2016 - 4:38:43 PM
Last modification on : Thursday, July 26, 2018 - 9:05:51 AM
Long-term archiving on : Wednesday, April 27, 2016 - 1:15:39 PM


Files produced by the author(s)


  • HAL Id : hal-01261825, version 1



Pauline Folz, Gabriela Montoya, Hala Skaf-Molli, Pascal Molli, Maria-Esther Vidal. Parallel Data Loading during Querying Deep Web and Linked Open Data with SPARQL. 11th International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS2015) at ISWC, Oct 2015, Bethlehem, United States. ⟨hal-01261825⟩



Record views


Files downloads