A Data Generator for Cloud-Scale Benchmarking - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

A Data Generator for Cloud-Scale Benchmarking

Michael Frank

Résumé

In many fields of research and business data sizes are breaking the petabyte barrier. This imposes new problems and research possibilities for the database community. Usually, data of this size is stored in large clusters or clouds. Although clouds have become very popular in recent years, there is only little work on benchmarking cloud applications. In this paper we present a data generator for cloud sized applications. Its architecture makes the data generator easy to extend and to configure. A key feature is the high degree of parallelism that allows linear scaling for arbitrary numbers of nodes. We show how distributions, relationships and dependencies in data can be computed in parallel with linear speed up.

Dates et versions

hal-01381621 , version 1 (14-10-2016)

Identifiants

Citer

Tilmann Rabl, Michael Frank, Hatem Mousselly-Sergieh, Harald Kosch. A Data Generator for Cloud-Scale Benchmarking. The Second TPC technology conference on Performance evaluation, measurement and characterization of complex systems, Sep 2010, Singapour, Singapore. pp.41-56, ⟨10.1007/978-3-642-18206-8_4⟩. ⟨hal-01381621⟩
99 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More