A Scheduler-Level Incentive Mechanism for Energy Efficiency in HPC

Abstract : Energy consumption has become one of the most important factors in High Performance Computing platforms. However, while there are various algorithmic and programming techniques to save energy, a user has currently no incentive to employ them, as they might result in worse performance. We propose to manage the energy budget of a supercomputer through EnergyFairShare (EFS), a FairShare-like scheduling algorithm. FairShare is a classic scheduling rule that prioritizes jobs belonging to users who were assigned small amount of CPU-second in the past. Similarly, EFS keeps track of users 'consumption of Watt-seconds and prioritizes those whom jobs consumed less energy. Therefore, EFS incentives users to optimize their code for energy efficiency. Having higher priority, jobs have smaller queuing times and, thus, smaller turn-around time. To validate this principle, we implemented EFS in a scheduling simulator and processed workloads from various HPC centers. The results show that, by reducing it energy consumption, auser will reduce it stretch (slowdown), compared to increasing it energy consumption. To validate the general feasibility odour approach, we also implemented EFS as an extension forSLURM, a popular HPC resource and job management system.We validated our plugin both by emulating a large scale platform, and by experiments upon a real cluster with monitored energy consumption. We observed smaller waiting times for energy efficient users.
Complete list of metadatas

Cited literature [25 references]  Display  Hide  Download

Contributor : Grégory Mounié <>
Submitted on : Thursday, December 3, 2015 - 2:42:23 PM
Last modification on : Thursday, April 4, 2019 - 10:18:05 AM
Long-term archiving on : Saturday, April 29, 2017 - 10:17:01 AM


Files produced by the author(s)







Yiannis Georgiou, David Glesser, Krzysztof Rzadca, Denis Trystram. A Scheduler-Level Incentive Mechanism for Energy Efficiency in HPC. CCGrid 2015 - 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, May 2015, Shenzhen, China. pp.617--626, ⟨10.1109/CCGrid.2015.101⟩. ⟨hal-01230295⟩



Record views


Files downloads