%0 Conference Paper %F Poster %T Intentional Data Placement Policy for Improving OLAP Cube Construction on Hadoop Clusters %+ Equipe de Recherche en Ingénierie des Connaissances (ERIC) %+ SID %A Arres, Billel %A Kabachi, Nadia %A Boussaid, Omar %A Bentayeb, Fadila %< avec comité de lecture %Z ERIC:14-034 %B 30ème édition de la Conférence Base Données Avancées - BDA 2014 %C Autrans, France %S Gestion de Données – Principes, Technologies et Applications %P 33-34 %8 2014-10-14 %D 2014 %K Data warehouseData placementHadoopMApreduce %Z Computer Science [cs]/Databases [cs.DB] %Z Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC]Conference poster %X In the recent past, we have witnessed dramatic increases in the volume of data literally in every area: business, science,and daily life to name a few. The Hadoop framework - an open source project based on the MapReduce paradigm - isa popular choice for big data analytics. However, the performance gained from Hadoop’s features is currently limitedby its default block placement policy, which does not take any data characteristics into account. Indeed, the efficiencyof many operations can be improved by a careful data placement, including indexing, grouping, aggregation and joins.In our work we propose a data warehouse partitioning strategy to improve query gain performances. We investigatethe performance gain for OLAP cube construction with and without data organization on a Hadoop cluster. And this,by varying the number of nodes and data warehouse size.Our experiments suggest that a good data placement on acluster during the implementation of the data warehousecan significantly increase the OLAP cube construction andquerying performances. In the next step, we will extendthe experiments to study the effects of other configurationparameters on collocation data in the context of paralleldata warehousing, such as partitions size, replication factorand OLAP query complexity. We plan also to study an in-telligent system for warehouses data placement on clustersby integrating Multi-Agent System (MAS) and IntelligentAgents to the process. %G English %L hal-01166222 %U https://hal.science/hal-01166222 %~ UNIV-LYON2 %~ ERIC %~ BDA %~ UDL