Skip to Main content Skip to Navigation
Conference papers

Adaptive Optimal Control of MapReduce Performance, Availability and Costs

Abstract : MapReduce is a popular programming model for distributed data processing and Big Data applications running on clouds. Extensive research has been conducted either to improve the dependability or to increase performance of MapReduce, ranging from adaptive and on-demand fault-tolerance solutions, adaptive task scheduling techniques to optimized job execution mechanisms. This paper investigates an optimization-based solution to control MapReduce systems in order to provide guarantees in terms of both performance and availability while reducing utilization costs. We follow a control theoretical approach for MapReduce cluster scaling and admission control. Moreover, we aim to be robust to changes in MapRe-duce and in it's environment by adapting the controller online to those changes. This paper highlights the major challenges of combining system adaptation and optimal control to take the best of both approaches. CCS Concepts • Networks → Cloud computing; • Software and its engineering → Software configuration management and version control systems; • Computer systems organization → Dependable and fault-tolerant systems and networks;
Complete list of metadata

Cited literature [19 references]  Display  Hide  Download
Contributor : Nicolas Marchand Connect in order to contact the contributor
Submitted on : Monday, June 13, 2016 - 12:36:28 PM
Last modification on : Saturday, June 25, 2022 - 10:21:40 AM


Files produced by the author(s)


  • HAL Id : hal-01331024, version 1


Sophie Cerf, Mihaly Berekmeri, Bogdan Robu, Nicolas Marchand, Sara Bouchenak. Adaptive Optimal Control of MapReduce Performance, Availability and Costs. Feedback Computin 2016 - 11th International Workshop on Feedback Computing, Jul 2016, Wurzburg, Germany. ⟨hal-01331024⟩



Record views


Files downloads