PAXQuery: A Massively Parallel XQuery Processor

Jesús Camacho-Rodríguez 1, 2 Dario Colazzo 3 Ioana Manolescu 1, 2
2 OAK - Database optimizations and architectures for complex large data
CNRS - Centre National de la Recherche Scientifique : UMR8623, Inria Saclay - Ile de France, UP11 - Université Paris-Sud - Paris 11, LRI - Laboratoire de Recherche en Informatique
Abstract : We present a novel approach for parallelizing the execution of queries over XML documents, implemented within our system PAXQuery. We compile a rich subset of XQuery into plans expressed in the PArallelization ConTracts (PACT) programming model. These plans are then optimized and executed in parallel by the Stratosphere system. We demonstrate the efficiency and scalability of our approach through experiments on hundreds of GB of XML data.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [17 references]  Display  Hide  Download
Contributor : Jesús Camacho-Rodríguez <>
Submitted on : Tuesday, November 25, 2014 - 11:19:23 AM
Last modification on : Monday, May 28, 2018 - 2:38:02 PM
Document(s) archivé(s) le : Thursday, February 26, 2015 - 10:15:37 AM


Publisher files allowed on an open archive




Jesús Camacho-Rodríguez, Dario Colazzo, Ioana Manolescu. PAXQuery: A Massively Parallel XQuery Processor. DanaC’14, Jun 2014, Snowbird, UT, United States. ⟨10.1145/2627770.2627772⟩. ⟨hal-01086808⟩



Record views


Files downloads