Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines

Terry Cojean 1, 2 Abdou Guermouche 2, 3, 4 Andra Hugo 5 Raymond Namyst 1, 2, 4 Pierre-André Wacrenier 1, 2, 4
1 STORM - STatic Optimizations, Runtime Methods
LaBRI - Laboratoire Bordelais de Recherche en Informatique, Inria Bordeaux - Sud-Ouest
3 HiePACS - High-End Parallel Algorithms for Challenging Numerical Simulations
LaBRI - Laboratoire Bordelais de Recherche en Informatique, Inria Bordeaux - Sud-Ouest
Abstract : Hybrid computing platforms are now commonplace, featuring a large number of CPU cores and accelerators. This trend makes balancing computations between these heterogeneous resources performance critical. In this paper we propose aggregating several CPU cores in order to execute larger parallel tasks and thus improve the load balance between CPUs and accelerators. Additionally, we present our approach to exploit internal parallelism within tasks. This is done by combining two runtime systems: one runtime system to handle the task graph and another one to manage the internal parallelism. We demonstrate the relevance of our approach in the context of the dense Cholesky factorization kernel implemented on top of the StarPU task-based runtime system. We present experimental results showing that our solution outperforms state of the art implementations.
Complete list of metadatas

Cited literature [17 references]  Display  Hide  Download

https://hal.inria.fr/hal-01181135
Contributor : Abdou Guermouche <>
Submitted on : Tuesday, August 23, 2016 - 3:55:13 PM
Last modification on : Monday, May 13, 2019 - 2:19:57 PM
Long-term archiving on : Thursday, November 24, 2016 - 1:11:13 PM

File

papier (1).pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

  • HAL Id : hal-01181135, version 3

Citation

Terry Cojean, Abdou Guermouche, Andra Hugo, Raymond Namyst, Pierre-André Wacrenier. Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines. HeteroPar'2016 worshop of Euro-Par, Aug 2016, Grenoble, France. ⟨hal-01181135v3⟩

Share

Metrics

Record views

557

Files downloads

631