Design methodology for workload-aware loop scheduling strategies based on genetic algorithm and simulation

Abstract : In high-performance computing, the application's workload must be evenly balanced among threads to deliver cutting-edge performance and scalability. In OpenMP, the load balancing problem arises when scheduling loop iterations to threads. In this context, several scheduling strategies have been proposed, but they do not take into account the input workload of the application and thus turn out to be suboptimal. In this work, we introduce a design methodology to propose, study, and assess the performance of workload-aware loop scheduling strategies. In this methodology, a genetic algorithm is employed to explore the state space solution of the problem itself and to guide the design of new loop scheduling strategies, and a simulator is used to evaluate their performance. As a proof of concept, we show how the proposed methodology was used to propose and study a new workload-aware loop scheduling strategy named smart round-robin (SRR). We implemented this strategy into GNU Compiler Collection's OpenMP runtime. We carry out several experiments to validate the simulator and to evaluate the performance of SRR. Our experimental results show that SRR may deliver up to 37.89% and 14.10% better performance than OpenMP's dynamic loop scheduling strategy in the simulated environment and in a real-world application kernel, respectively.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01354028
Contributor : Jean-Francois Méhaut <>
Submitted on : Monday, January 23, 2017 - 2:55:05 PM
Last modification on : Monday, July 8, 2019 - 3:11:44 PM
Long-term archiving on : Monday, April 24, 2017 - 2:34:21 PM

File

main.pdf
Files produced by the author(s)

Identifiers

Citation

Pedro Henrique Penna, Márcio Castro, Henrique Cota de Freitas, François Broquedis, Jean-François Méhaut. Design methodology for workload-aware loop scheduling strategies based on genetic algorithm and simulation. Concurrency and Computation: Practice and Experience, Wiley, 2017, 29 (22), ⟨10.1002/cpe.3933⟩. ⟨hal-01354028⟩

Share

Metrics

Record views

447

Files downloads

384