Pipelined parallelism for multi-join queries on shared nothing machines

Mostafa Bamha; Matthieu Exbrayat

Communication Dans Un Congrès Année : 2003

Pipelined parallelism for multi-join queries on shared nothing machines

(1) , (1)

Mostafa Bamha

Fonction : Auteur
PersonId : 834021

Laboratoire d'Informatique Fondamentale d'Orléans

Matthieu Exbrayat

Fonction : Auteur
PersonId : 18139
IdHAL : matthieu-exbrayat
ORCID : 0000-0002-1740-4752
IdRef : 102424314

Laboratoire d'Informatique Fondamentale d'Orléans

Résumé

The development of scalable parallel database systems requires the design of efficient algorithms especially for the join which is the most frequent and expensive operation in relational database systems. Join is also the most vulnerable operation to data skew and to the high cost of communication in distributed architectures. Moreover, for multi-join queries, the problem of data-skew is more complicated because the imbalance of intermediate results is unknown during static query optimization. In this paper, we show that the join algorithms we presented in our earlier papers, can be applied efficiently in various parallel execution strategies making it possible to exploit not only intra-operator parallelism but also inter-operator parallelism. These algorithms reduce the communication and synchronization costs to a minimum while guaranteeing a perfect load balancing during all the stages of join computation even for highly skewed data.

Mots clés

PDBMS : Parallel Database Management Systems Intra-transaction parallelism Parallel joins Multi-joins Data skew Join-product skew Dynamic load balancing.

Dynamic load balancing

Domaines

Informatique

Mostafa Bamha : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00081353

Soumis le : jeudi 22 juin 2006-18:38:47

Dernière modification le : samedi 25 juin 2022-10:10:45

Dates et versions

hal-00081353 , version 1 (22-06-2006)

Identifiants

HAL Id : hal-00081353 , version 1

Citer

Mostafa Bamha, Matthieu Exbrayat. Pipelined parallelism for multi-join queries on shared nothing machines. (ParCo 2003), 2003, France. ⟨hal-00081353⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ORLEANS MSL MSL-THESE

38 Consultations

0 Téléchargements

Pipelined parallelism for multi-join queries on shared nothing machines

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager