Batched Cholesky Factorization for tiny matrices

Florian Lemaitre; Lionel Lacassagne

Communication Dans Un Congrès Année : 2016

Batched Cholesky Factorization for tiny matrices

(1, 2) , (1)

1
2

Florian Lemaitre

Fonction : Auteur
PersonId : 14384
IdHAL : florian-lemaitre
ORCID : 0000-0002-3362-4229
IdRef : 24947039X

Architecture et Logiciels pour Systèmes Embarqués sur Puce

European Organization for Nuclear Research

Lionel Lacassagne

Fonction : Auteur
PersonId : 1009
IdHAL : llacassagne
ORCID : 0000-0002-1056-9458
IdRef : 140256849

Architecture et Logiciels pour Systèmes Embarqués sur Puce

Résumé

Many linear algebra libraries, such as the Intel MKL, Magma or Eigen, provide fast Cholesky factorization. These libraries are suited for big matrices but perform slowly on small ones. Even though State-of-the-Art studies begin to take an interest in small matrices, they usually feature a few hundreds rows. Fields like Computer Vision or High Energy Physics use tiny matrices. In this paper we show that it is possible to speedup the Cholesky factorization for tiny matrices by grouping them in batches and using highly specialized code. We provide High Level Transformations that accelerate the factorization for current Intel SIMD architectures (SSE, AVX2, KNC, AVX512). We achieve with these transformations combined with SIMD a speedup from 13 to 31 for the whole resolution compared to the naive code on a single core AVX2 machine and a speedup from 15 to 33 with multithreading compared to the multithreaded naive code.

Mots clés

SIMD tiny matrices batched factorization cholesky linear algebra SSE multi-thread AVX

Domaines

Algorithme et structure de données [cs.DS] Architectures Matérielles [cs.AR] Automatique / Robotique Arithmétique des ordinateurs Calcul parallèle, distribué et partagé [cs.DC] Génie logiciel [cs.SE] Traitement des images [eess.IV]

Fichier principal

dasip_2016_final_draft.pdf (587.76 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Lionel Lacassagne : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01361204

Soumis le : mardi 6 septembre 2016-17:53:28

Dernière modification le : mardi 11 avril 2023-15:16:28

Archivage à long terme le : mercredi 7 décembre 2016-13:25:31

Dates et versions

hal-01361204 , version 1 (06-09-2016)

Identifiants

HAL Id : hal-01361204 , version 1

Citer

Florian Lemaitre, Lionel Lacassagne. Batched Cholesky Factorization for tiny matrices. Design and Architectures for Signal and Image Processing (DASIP), ECSI, Oct 2016, Rennes, France. pp.1--8. ⟨hal-01361204⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 TDS-MACS SORBONNE-UNIVERSITE SU-SCIENCES

232 Consultations

644 Téléchargements

Batched Cholesky Factorization for tiny matrices

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager