Task-based multifrontal QR solver for GPU-accelerated multicore architectures

Abstract : Recent studies have shown the potential of task-based programming paradigms for implementing robust, scalable sparse direct solvers for modern computing platforms. Yet, designing task flows that efficiently exploit heterogeneous architectures remains highly challenging. In this paper we first tackle the issue of data partitioning using a method suited for heterogeneous platforms. On the one hand, we design task of sufficiently large granularity to obtain a good acceleration factor on GPU. On the other hand, we limit that size in order to both fit the GPU memory constraints and generate enough parallelism in the task graph. Secondly we handle the task scheduling with a strategy capable of taking into account workload and architecture heterogeneity at a reduced cost. Finally we propose an original evaluation of the performance obtained in our solver on a test set of matrices. We show that the proposed approach allows for processing extremely large input problems on GPU-accelerated platforms and that the overall performance is competitive with equivalent state of the art solvers designed and optimized for GPU-only use.
Liste complète des métadonnées

Cited literature [27 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01166312
Contributor : Alfredo Buttari <>
Submitted on : Tuesday, September 22, 2015 - 8:56:39 PM
Last modification on : Friday, April 12, 2019 - 4:23:33 PM
Document(s) archivé(s) le : Wednesday, April 26, 2017 - 6:34:45 PM

File

IRI-RT--2015--02--FR-r1.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01166312, version 2

Citation

Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche, Florent Lopez. Task-based multifrontal QR solver for GPU-accelerated multicore architectures. [Research Report] IRI/RT--2015--02--FR-r1, IRIT, Toulouse. 2015. ⟨hal-01166312v2⟩

Share

Metrics

Record views

562

Files downloads

315