Pufferbench: Evaluating and Optimizing Malleability of Distributed Storage - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Pufferbench: Evaluating and Optimizing Malleability of Distributed Storage

Résumé

Malleability is the property of an application to be dynamically rescaled at run time. It requires the possibility to dynamically add or remove resources to the infrastructure without interruption. Yet, many Big Data applications cannot benefit from their inherent malleability, since their colocated distributed storage system is not malleable in practice. Commissioning or decommissioning storage nodes is generally assumed to be slow, as such operations have typically been designed for maintenance only. New technologies, however, enable faster data transfers. Still, evaluating the performance of rescaling operations on a given platform is a challenge in itself: no tool currently exists for this purpose. We introduce Pufferbench, a benchmark for evaluating how fast one can scale up and down a distributed storage system on a given infrastructure and, thereby, how viably can one implement storage malleability on it. Besides, it can serve to quickly prototype and evaluate mechanisms for malleability in existing distributed storage systems. We validate Pufferbench against theoretical lower bounds for commission and decommis-sion: it can achieve performance within 16% of them. We use Pufferbench to evaluate in practice these operations in HDFS: commission in HDFS could be accelerated by as much as 14 times! Our results show that: (1) the lower bounds for commission and decommission times we previously established are sound and can be approached in practice; (2) HDFS could handle these operations much more efficiently; most importantly, (3) malleability in distributed storage systems is viable and should be further leveraged for Big Data applications.
Fichier principal
Vignette du fichier
Paper.pdf (174.1 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01892713 , version 1 (10-10-2018)

Identifiants

Citer

Nathanaël Cheriere, Matthieu Dorier, Gabriel Antoniu. Pufferbench: Evaluating and Optimizing Malleability of Distributed Storage. PDSW-DISCS 2018: 3rd Joint International workshop on Parallel Data Storage and Data Intensive Scalable computing Systems, Nov 2018, Dallas, United States. pp.1-10, ⟨10.1109/PDSW-DISCS.2018.00006⟩. ⟨hal-01892713⟩
224 Consultations
235 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More