Task-based multifrontal QR solver for GPU-accelerated multicore architectures

Abstract : Recent studies have shown the potential of task-based programming paradigms for implementing robust, scalable sparse direct solvers for modern computing platforms. Yet, designing task flows that efficiently exploit heterogeneous architectures remains highly challenging. In this paper we first tackle the issue of data partitioning using a method suited for heterogeneous platforms. On the one hand, we design task of sufficiently large granularity to obtain a good acceleration factor on GPU. On the other hand, we limit that size in order to both fit the GPU memory constraints and generate enough parallelism in the task graph. Secondly we handle the task scheduling with a strategy capable of taking into account workload and architecture heterogeneity at a reduced cost. Finally we propose an original evaluation of the performance obtained in our solver on a test set of matrices. We show that the proposed approach allows for processing extremely large input problems on GPU-accelerated platforms and that the overall performance is competitive with equivalent state of the art solvers designed and optimized for GPU-only use.
Type de document :
Rapport
[Research Report] IRI/RT--2015--02--FR-r1, IRIT, Toulouse. 2015
Liste complète des métadonnées

Littérature citée [27 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01166312
Contributeur : Alfredo Buttari <>
Soumis le : mardi 22 septembre 2015 - 20:56:39
Dernière modification le : samedi 17 septembre 2016 - 01:37:18
Document(s) archivé(s) le : mercredi 26 avril 2017 - 18:34:45

Fichier

IRI-RT--2015--02--FR-r1.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01166312, version 2

Citation

Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche, Florent Lopez. Task-based multifrontal QR solver for GPU-accelerated multicore architectures. [Research Report] IRI/RT--2015--02--FR-r1, IRIT, Toulouse. 2015. 〈hal-01166312v2〉

Partager

Métriques

Consultations de la notice

458

Téléchargements de fichiers

230