HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation

Process Placement in Multicore Clusters: Algorithmic Issues and Practical Techniques

Emmanuel Jeannot 1, 2 Guillaume Mercier 1, 2 François Tessier 2, 1
2 RUNTIME - Efficient runtime systems for parallel architectures
CNRS - Centre National de la Recherche Scientifique : UMR5800, UB - Université de Bordeaux, Inria Bordeaux - Sud-Ouest
Abstract : Current generations of NUMA node clusters feature multicore or manycore processors. Programming such architectures efficiently is a challenge because numerous hardware characteristics have to be taken into account, especially the memory hierarchy. One appealing idea to improve the performance of parallel applications is to decrease their communication costs by matching the communication pattern to the underlying hardware architecture. In this report, we detail the algorithm and techniques proposed to achieve such a result: first, we gather both the communication pattern information and the hardware details. Then we compute a relevant reordering of the various process ranks of the application. Finally, those new ranks are used to reduce the communication costs of the application.
Complete list of metadata

Cited literature [37 references]  Display  Hide  Download

Contributor : Guillaume Mercier Connect in order to contact the contributor
Submitted on : Friday, March 22, 2013 - 11:28:26 AM
Last modification on : Friday, January 21, 2022 - 3:12:55 AM
Long-term archiving on: : Monday, June 24, 2013 - 12:10:16 PM


Files produced by the author(s)


  • HAL Id : hal-00803548, version 1


Emmanuel Jeannot, Guillaume Mercier, François Tessier. Process Placement in Multicore Clusters: Algorithmic Issues and Practical Techniques. [Research Report] RR-8269, INRIA. 2013, pp.32. ⟨hal-00803548⟩



Record views


Files downloads