HAQWA: a Hash-based and Query Workload Aware Distributed RDF Store - Archive ouverte HAL Accéder directement au contenu
Poster De Conférence Année : 2015

HAQWA: a Hash-based and Query Workload Aware Distributed RDF Store

Olivier Curé
Hubert Naacke
Mohamed-Amine Baazizi
Bernd Amann

Résumé

Like most data models encountered in the Big Data ecosystem, RDF stores are managing large data sets by partitioning triples across a cluster of machines. Nevertheless, the graphical nature of RDF data as well as its associated SPARQL query execution model makes the efficient data distribution more involved than in other data models, e.g., relational. In this paper, we propose a novel system that is characterized by a trade-off between complexity of data partitioning and efficiency of query answering in cases where a query workload is known. The prototype is implemented over the Apache Spark framework, ensuring high availability, fault tolerance and scalability. This short paper presents the main features of the system and highlights the omnipresence of parallel computation across data fragmentation and allocation, encoding and query processing tasks.
Fichier non déposé

Dates et versions

hal-01214900 , version 1 (13-10-2015)

Identifiants

  • HAL Id : hal-01214900 , version 1

Citer

Olivier Curé, Hubert Naacke, Mohamed-Amine Baazizi, Bernd Amann. HAQWA: a Hash-based and Query Workload Aware Distributed RDF Store. The 14th International Semantic Web Conference, ISWC 2015, Oct 2015, Bethlehem, Pennsylvania, United States. CEUR-WS.org, 1486, CEUR Workshop Proceedings. ⟨hal-01214900⟩
337 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More