Reproducible and Accurate Matrix Multiplication for High-Performance Computing

Caroline Collange; David Defour; Stef Graillat; Roman Iakymchuk

Communication Dans Un Congrès Année : 2014

Reproducible and Accurate Matrix Multiplication for High-Performance Computing

(1) , (2) , (3) , (3)

1
2
3

Caroline Collange

Fonction : Auteur
PersonId : 177452
IdHAL : caroline-collange
IdRef : 151116776

Amdahl's Law is Forever

David Defour

Fonction : Auteur
PersonId : 4651
IdHAL : david-defour
ORCID : 0000-0001-9923-2394
IdRef : 104542454

Digits, Architectures et Logiciels Informatiques

Stef Graillat

Fonction : Auteur
PersonId : 5653
IdHAL : stef-graillat
IdRef : 104060735

Performance et Qualité des Algorithmes Numériques

Roman Iakymchuk

Fonction : Auteur
PersonId : 966
IdHAL : roman-iakymchuk
IdRef : 253135079

Performance et Qualité des Algorithmes Numériques

Résumé

On modern multi-core, many-core, and heterogeneous architectures, floating-point computations may become non-deterministic and thus non-reproducible mainly due to non-associativity of floating-point operations. We introduce an algorithm to compute a product of two floating-point matrices that delivers reproducible results with the best possible accuracy. Our multi-level algorithm relies on fast vectorized floating-point expansions and as well as superaccumulators in a high-radix carry-save representation. We present implementations on recent Intel Xeon Phi accelerators and both AMD and NVIDIA GPUs.

Mots clés

multi-precision multi- and many-core architectures. Matrix multiplication reproducibility accuracy long accumulator

Domaines

Informatique [cs]

Fichier principal

scan14-1.pdf (192.74 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Lip6 Publications : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01215627

Soumis le : mercredi 23 novembre 2016-18:25:34

Dernière modification le : mardi 11 avril 2023-15:16:28

Archivage à long terme le : mardi 21 mars 2017-05:31:19

Dates et versions

hal-01215627 , version 1 (23-11-2016)

Identifiants

HAL Id : hal-01215627 , version 1

Citer

Caroline Collange, David Defour, Stef Graillat, Roman Iakymchuk. Reproducible and Accurate Matrix Multiplication for High-Performance Computing. SCAN: Scientific Computing, Computer Arithmetic and Validated Numerics, Sep 2014, Wuerzburg, Germany. pp.42-43. ⟨hal-01215627⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UPMC EC-PARIS UNIV-RENNES1 CNRS INRIA UNIV-PERP INSA-RENNES IRISA LIP6 DALI LIRMM IRISA-D3 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC MIPS UNIV-MONTPELLIER UNIV-RENNES SORBONNE-UNIVERSITE SU-SCIENCES UR1-MATH-NUM

366 Consultations

89 Téléchargements

Reproducible and Accurate Matrix Multiplication for High-Performance Computing

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager