PAXQuery: Parallel Analytical XML Processing - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

PAXQuery: Parallel Analytical XML Processing

Résumé

XQuery is a general-purpose programming language for processing semi-structured data, and as such, it is very expressive. As a consequence , optimizing and parallelizing complex analytics XQuery queries is still an open, challenging problem. We demonstrate PAXQuery, a novel system that parallelizes the execution of XQuery queries over large collections of XML documents. PAXQuery compiles a rich subset of XQuery into plans expressed in the PArallelization ConTracts (PACT) programming model. Thanks to this translation, the resulting plans are optimized and executed in a massively parallel fashion by the Apache Flink system. The result is a scalable system capable of querying massive amounts of XML data very efficiently, as proved by the experimental results we outline.
Fichier principal
Vignette du fichier
PAXQuery-SIGMOD2015.pdf (879.81 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01178490 , version 1 (20-07-2015)

Identifiants

Citer

Jesús Camacho-Rodríguez, Dario Colazzo, Ioana Manolescu, Juan A. M. Naranjo. PAXQuery: Parallel Analytical XML Processing. ACM SIGMOD International Conference on Management of Data 2015, May 2015, Melbourne, Victoria, Australia. pp.1117-1122, ⟨10.1145/2723372.2735374⟩. ⟨hal-01178490⟩
451 Consultations
259 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More