Alea -Complex Job Scheduling Simulator - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Alea -Complex Job Scheduling Simulator

Dalibor Klusáček
  • Fonction : Auteur
  • PersonId : 1056514
Mehmet Soysal
  • Fonction : Auteur
  • PersonId : 1056691
Frédéric Suter

Résumé

Using large computer systems such as HPC clusters up to their full potential can be hard. Many problems and inefficiencies relate to the interactions of user workloads and system-level policies. These policies enable various setup choices of the resource management system (RMS) as well as the applied scheduling policy. While expert's assessment and well known best practices do their job when tuning the performance , there is usually plenty of room for further improvements, e.g., by considering more efficient system setups or even radically new scheduling policies. For such potentially damaging modifications it is very suitable to use some form of a simulator first, which allows for repeated evaluations of various setups in a fully controlled manner. This paper presents the latest improvements and advanced simulation capabilities of the Alea job scheduling simulator that has been actively developed for over 10 years now. We present both recently added advanced simulation capabilities as well as a set of real-life based case studies where Alea has been used to evaluate major modifications of real HPC and HTC systems.
Fichier principal
Vignette du fichier
PPAM_2019.pdf (345.62 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02329635 , version 1 (23-10-2019)

Identifiants

  • HAL Id : hal-02329635 , version 1

Citer

Dalibor Klusáček, Mehmet Soysal, Frédéric Suter. Alea -Complex Job Scheduling Simulator. 13th INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING AND APPLIED MATHEMATICS, Sep 2019, Bialystok, Poland. ⟨hal-02329635⟩
102 Consultations
408 Téléchargements

Partager

Gmail Facebook X LinkedIn More