A class of parallel tiled linear algebra algorithms for multicore architectures

Alfredo Buttari; Julien Langou; Jakub Kurzak; Jack J. Dongarra

doi:10.1016/j.parco.2008.10.002

Article Dans Une Revue Parallel Computing Année : 2009

A class of parallel tiled linear algebra algorithms for multicore architectures

(1, 2) , (3) , (4) , (4)

1
2
3
4

Alfredo Buttari

Fonction : Auteur
PersonId : 170442
IdHAL : alfredo-buttari
ORCID : 0000-0003-3207-7021
IdRef : 167548999

Algorithmes Parallèles et Optimisation

Centre National de la Recherche Scientifique

Julien Langou

Fonction : Auteur

Department of Mathematical and Statistical Sciences

Jakub Kurzak

Fonction : Auteur

Innovative Computing Laboratory [Knoxville]

Jack J. Dongarra

Fonction : Auteur

Innovative Computing Laboratory [Knoxville]

Résumé

As multicore systems continue to gain ground in the high performance computing world, linear algebra algorithms have to be reformulated or new algorithms have to be developed in order to take advantage of the architectural features on these new processors. Fine grain parallelism becomes a major requirement and introduces the necessity of loose synchronization in the parallel execution of an operation. This paper presents algorithms for the Cholesky, LU and QR factorization where the operations can be represented as a sequence of small tasks that operate on square blocks of data. These tasks can be dynamically scheduled for execution based on the dependencies among them and on the availability of computational resources. This may result in out of order execution of tasks which will completely hide the presence of intrinsically sequential tasks in the factorization. Performance comparisons are presented with LAPACK algorithms where parallelism can only be exploited at the level of the BLAS operations and vendor implementations.

Domaines

Calcul parallèle, distribué et partagé [cs.DC] Algorithme et structure de données [cs.DS] Logiciel mathématique [cs.MS]

Alfredo Buttari : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02420965

Soumis le : vendredi 20 décembre 2019-11:05:20

Dernière modification le : lundi 20 novembre 2023-11:44:23

Dates et versions

hal-02420965 , version 1 (20-12-2019)

Identifiants

HAL Id : hal-02420965 , version 1
DOI : 10.1016/j.parco.2008.10.002

Citer

Alfredo Buttari, Julien Langou, Jakub Kurzak, Jack J. Dongarra. A class of parallel tiled linear algebra algorithms for multicore architectures. Parallel Computing, 2009, 35 (1), pp.38-53. ⟨10.1016/j.parco.2008.10.002⟩. ⟨hal-02420965⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 CNRS UT1-CAPITOLE IRIT IRIT-APO IRIT-CISO IRIT-CNRS TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

59 Consultations

0 Téléchargements

A class of parallel tiled linear algebra algorithms for multicore architectures

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager