Skip to Main content Skip to Navigation
Poster communications

HAQWA: a Hash-based and Query Workload Aware Distributed RDF Store

Abstract : Like most data models encountered in the Big Data ecosystem, RDF stores are managing large data sets by partitioning triples across a cluster of machines. Nevertheless, the graphical nature of RDF data as well as its associated SPARQL query execution model makes the efficient data distribution more involved than in other data models, e.g., relational. In this paper, we propose a novel system that is characterized by a trade-off between complexity of data partitioning and efficiency of query answering in cases where a query workload is known. The prototype is implemented over the Apache Spark framework, ensuring high availability, fault tolerance and scalability. This short paper presents the main features of the system and highlights the omnipresence of parallel computation across data fragmentation and allocation, encoding and query processing tasks.
Document type :
Poster communications
Complete list of metadata
Contributor : Lip6 Publications Connect in order to contact the contributor
Submitted on : Tuesday, October 13, 2015 - 11:57:18 AM
Last modification on : Friday, September 16, 2022 - 1:56:06 PM


  • HAL Id : hal-01214900, version 1


Olivier Curé, Hubert Naacke, Mohamed-Amine Baazizi, Bernd Amann. HAQWA: a Hash-based and Query Workload Aware Distributed RDF Store. The 14th International Semantic Web Conference, ISWC 2015, Oct 2015, Bethlehem, Pennsylvania, United States., 1486, CEUR Workshop Proceedings. ⟨hal-01214900⟩



Record views