Adaptive Resource and Job Management for Limited Power Consumption

Abstract : The last decades have been characterized by an ever growing requirement in terms of computing and storage resources. This tendency has recently put the pressure on the ability to efficiently manage the power required to operate the huge amount of electrical components associated with state-of-the-art high performance computing systems. The power consumption of a supercomputer needs to be adjusted based on varying power budget or electricity availabilities. As a consequence, Resource and Job Management Systems have to be adequately adapted in order to efficiently schedule jobs with optimized performance while limiting power usage whenever needed. We introduce in this paper a new scheduling strategy that can adapt the executed workload to a limited power budget. The originality of this approach relies upon a combination of speed scaling and node shutdown techniques for power reductions. It is implemented into the widely used resource and job management system SLURM. Finally, it is validated through large scale emulations using real production workload traces of the supercomputer Curie.
Complete list of metadatas

Cited literature [30 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01230292
Contributor : Grégory Mounié <>
Submitted on : Thursday, December 3, 2015 - 2:46:13 PM
Last modification on : Thursday, April 4, 2019 - 10:18:05 AM
Long-term archiving on : Friday, March 4, 2016 - 1:00:14 PM

File

SENT_ipdps.pdf.pdf
Files produced by the author(s)

Licence


Copyright

Identifiers

Collections

UGA | INRIA | LIG

Citation

Yiannis Georgiou, David Glesser, Denis Trystram. Adaptive Resource and Job Management for Limited Power Consumption. IEEE International Parallel and Distributed Processing Symposium Workshop, IPDPS 2015, Hyderabad, India, May 25-29, 2015, Hyderabad, India. pp.863--870, ⟨10.1109/IPDPSW.2015.118⟩. ⟨hal-01230292⟩

Share

Metrics

Record views

762

Files downloads

261