Optimizing OLAP Cubes Construction by Improving Data Placement on Multi-nodes Clusters

Billel Arres 1, 2 Nadia Kabachi 1, 2, * Omar Boussaid 1, 2, *
* Auteur correspondant
2 SID
ERIC - Equipe de Recherche en Ingénierie des Connaissances
Abstract : The increasing volumes of relational data let us find an alternative to cope with them. The Hadoop framework - which is an open source project based on the MapReduce paradigm - is a popular choice for big data analytics. However, the performance gained from Hadoop’s features is currently limited by its default block placement policy, which does not take any data characteristics into account. Indeed, the efficiency of many operations can be improved by a careful data placement, including indexing, grouping, aggregation and joins. In this paper we propose a data warehouse placement policy to improve query gain performances on multi nodes clusters, especially Hadoop clusters. We investigate the performance gain for OLAP cube construction query with and without data organization. And this, by varying the number of nodes and data warehouse size. It has been found that, the proposed data placement policy has lowered global execution time for building OLAP data cubes up to 20 percent compared to default data placement.
Type de document :
Communication dans un congrès
IEEE. 23rd EuroPDP International Conference on Parallel, Distributed, and Network-Based P, Mar 2015, Turku, Finland. pp.520 - 524, 2015, 〈10.1109/PDP.2015.45〉
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01166226
Contributeur : Fabien Rico <>
Soumis le : lundi 22 juin 2015 - 13:12:22
Dernière modification le : mercredi 13 janvier 2016 - 10:08:03

Identifiants

Collections

Citation

Billel Arres, Nadia Kabachi, Omar Boussaid. Optimizing OLAP Cubes Construction by Improving Data Placement on Multi-nodes Clusters. IEEE. 23rd EuroPDP International Conference on Parallel, Distributed, and Network-Based P, Mar 2015, Turku, Finland. pp.520 - 524, 2015, 〈10.1109/PDP.2015.45〉. 〈hal-01166226〉

Partager

Métriques

Consultations de la notice

27