Skip to Main content Skip to Navigation

Task-based multifrontal QR solver for GPU-accelerated multicore architectures

Abstract : Recent studies have shown the potential of task-based programming paradigms for implementing robust, scalable sparse direct solvers for modern computing platforms. Yet, designing task flows that efficiently exploit heterogeneous architectures remains highly challenging. In this paper we first tackle the issue of data partitioning using a method suited for heterogeneous platforms. On the one hand, we design task of sufficiently large granularity to obtain a good acceleration factor on GPU. On the other hand, we limit that size in order to both fit the GPU memory constraints and generate enough parallelism in the task graph. Secondly we handle the task scheduling with a strategy capable of taking into account workload and architecture heterogeneity at a reduced cost. Finally we propose an original evaluation of the performance obtained in our solver on a test set of matrices. We show that the proposed approach allows for processing extremely large input problems on GPU-accelerated platforms and that the overall performance is competitive with equivalent state of the art solvers designed and optimized for GPU-only use.
Complete list of metadata

Cited literature [27 references]  Display  Hide  Download
Contributor : Alfredo Buttari Connect in order to contact the contributor
Submitted on : Tuesday, September 22, 2015 - 8:56:39 PM
Last modification on : Saturday, December 4, 2021 - 3:06:15 AM
Long-term archiving on: : Wednesday, April 26, 2017 - 6:34:45 PM


Files produced by the author(s)


  • HAL Id : hal-01166312, version 2


Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche, Florent Lopez. Task-based multifrontal QR solver for GPU-accelerated multicore architectures. [Research Report] IRI/RT--2015--02--FR-r1, IRIT, Toulouse. 2015. ⟨hal-01166312v2⟩



Record views


Files downloads