From Simulation to Experiment: A Case Study on Multiprocessor Task Scheduling

Abstract : Simulation is a popular approach for empirically evaluating the performance of algorithms and applications in the parallel computing domain. Most published works present results without quantifying simulation error. In this work we investigate accuracy issues when simulating the execution of parallel applications. This is a broad question, and we focus on a relevant case study: the evaluation of scheduling algorithms for executing mixed-parallel applications on clusters. Most such scheduling algorithms have been evaluated in simulation only. We compare simulations to real-world experiments in a view to identify which features of a simulator are most critical for simulation accuracy. Our first finding is that simple yet popular analytical simulation models lead to simulation results that cannot be used for soundly comparing scheduling algorithms. We then show that, by contrast, simulation models instantiated based on brute-force measurements of the target execution environment lead to usable results. Finally, we develop empirical simulation models that provide a reasonable compromise between the two previous approaches.
Type de document :
Communication dans un congrès
13th Workshop on Advances in Parallel and Distributed Computational Models (APDCM), May 2011, Anchorage, Alaska, United States. pp.665-672, 2011
Liste complète des métadonnées

Littérature citée [19 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00627842
Contributeur : Frederic Suter <>
Soumis le : jeudi 29 septembre 2011 - 16:35:46
Dernière modification le : jeudi 29 septembre 2011 - 16:41:09
Document(s) archivé(s) le : mardi 13 novembre 2012 - 14:51:00

Fichier

sim2real.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00627842, version 1

Collections

IN2P3 | INRIA | LIG | UGA

Citation

Sascha Hunold, Henri Casanova, Frederic Suter. From Simulation to Experiment: A Case Study on Multiprocessor Task Scheduling. 13th Workshop on Advances in Parallel and Distributed Computational Models (APDCM), May 2011, Anchorage, Alaska, United States. pp.665-672, 2011. 〈hal-00627842〉

Partager

Métriques

Consultations de
la notice

312

Téléchargements du document

143