Skip to Main content Skip to Navigation
Poster communications

HAQWA: a Hash-based and Query Workload Aware Distributed RDF Store

Abstract : Like most data models encountered in the Big Data ecosystem, RDF stores are managing large data sets by partitioning triples across a cluster of machines. Nevertheless, the graphical nature of RDF data as well as its associated SPARQL query execution model makes the efficient data distribution more involved than in other data models, e.g., relational. In this paper, we propose a novel system that is characterized by a trade-off between complexity of data partitioning and efficiency of query answering in cases where a query workload is known. The prototype is implemented over the Apache Spark framework, ensuring high availability, fault tolerance and scalability. This short paper presents the main features of the system and highlights the omnipresence of parallel computation across data fragmentation and allocation, encoding and query processing tasks.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01214900
Contributor : Lip6 Publications <>
Submitted on : Tuesday, October 13, 2015 - 11:57:18 AM
Last modification on : Wednesday, February 26, 2020 - 7:06:07 PM

Identifiers

  • HAL Id : hal-01214900, version 1

Citation

Olivier Curé, Hubert Naacke, Mohamed-Amine Baazizi, Bernd Amann. HAQWA: a Hash-based and Query Workload Aware Distributed RDF Store. The 14th International Semantic Web Conference, ISWC 2015, Oct 2015, Bethlehem, Pennsylvania, United States. CEUR-WS.org, 1486, CEUR Workshop Proceedings. ⟨hal-01214900⟩

Share

Metrics

Record views

321