Progress Thread Placement for Overlapping MPI Non-Blocking Collectives using Simultaneous Multi-Threading - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Progress Thread Placement for Overlapping MPI Non-Blocking Collectives using Simultaneous Multi-Threading

Résumé

Non-blocking collectives have been proposed so as to allow communications to be overlapped with computation in order to amor-tize the cost of MPI collective operations. To obtain a good overlap ratio, communications and computation have to run in parallel. To achieve this, different hardware and software techniques exists. Dedicated some cores to run progress threads is one of them. However, some CPUs provide Simultaneous Multi-Threading, which is the ability for a core to have multiple hardware threads running simultaneously, sharing the same arithmetic units. Our idea is to use them to run progress threads to avoid dedicated cores allocation. We have run benchmarks on Haswell processors , using its Hyper-Threading capability, and get good results for both performance and overlap only when inter-node communications are used by MPI processes. However, we also show that enabling Simultaneous Multi-Threading for intra-communications leads to bad performances due to cache effects.
Fichier principal
Vignette du fichier
smt2018.pdf (248.32 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01888257 , version 1 (04-10-2018)

Identifiants

  • HAL Id : hal-01888257 , version 1

Citer

Alexandre Denis, Julien Jaeger, Hugo Taboada. Progress Thread Placement for Overlapping MPI Non-Blocking Collectives using Simultaneous Multi-Threading. COLOC : 2nd workshop on data locality, in conjuction with EURO-PAR 2018, Aug 2018, Turin, Italy. ⟨hal-01888257⟩
128 Consultations
584 Téléchargements

Partager

Gmail Facebook X LinkedIn More