HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Reproducible and Accurate Matrix Multiplication for High-Performance Computing

Caroline Collange 1 David Defour 2 Stef Graillat 3 Roman Iakymchuk 3
1 ALF - Amdahl's Law is Forever
Inria Rennes – Bretagne Atlantique , IRISA-D3 - ARCHITECTURE
2 DALI - Digits, Architectures et Logiciels Informatiques
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, UPVD - Université de Perpignan Via Domitia
3 PEQUAN - Performance et Qualité des Algorithmes Numériques
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : On modern multi-core, many-core, and heterogeneous architectures, floating-point computations may become non-deterministic and thus non-reproducible mainly due to non-associativity of floating-point operations. We introduce an algorithm to compute a product of two floating-point matrices that delivers reproducible results with the best possible accuracy. Our multi-level algorithm relies on fast vectorized floating-point expansions and as well as superaccumulators in a high-radix carry-save representation. We present implementations on recent Intel Xeon Phi accelerators and both AMD and NVIDIA GPUs.
Document type :
Conference papers
Complete list of metadata

Cited literature [2 references]  Display  Hide  Download

Contributor : Lip6 Publications Connect in order to contact the contributor
Submitted on : Wednesday, November 23, 2016 - 6:25:34 PM
Last modification on : Thursday, December 16, 2021 - 2:04:02 PM
Long-term archiving on: : Tuesday, March 21, 2017 - 5:31:19 AM


Files produced by the author(s)


  • HAL Id : hal-01215627, version 1


Caroline Collange, David Defour, Stef Graillat, Roman Iakymchuk. Reproducible and Accurate Matrix Multiplication for High-Performance Computing. SCAN: Scientific Computing, Computer Arithmetic and Validated Numerics, Sep 2014, Wuerzburg, Germany. pp.42-43. ⟨hal-01215627⟩



Record views


Files downloads