Skip to Main content Skip to Navigation
Conference papers

ExBLAS: Reproducible and Accurate BLAS Library

Roman Iakymchuk 1, 2 Sylvain Collange 3 David Defour 4 Stef Graillat 1
1 PEQUAN - Performance et Qualité des Algorithmes Numériques
LIP6 - Laboratoire d'Informatique de Paris 6
3 ALF - Amdahl's Law is Forever
Inria Rennes – Bretagne Atlantique , IRISA-D3 - ARCHITECTURE
4 DALI - Digits, Architectures et Logiciels Informatiques
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, UPVD - Université de Perpignan Via Domitia
Abstract : Due to non-associativity of floating-point operations and dynamic scheduling on parallel architectures, getting a bit-wise reproducible floating-point result for multiple executions of the same code on different or even similar parallel architectures is challenging. We address the problem of reproducibility in the context of fundamental linear algebra operations – like the ones included in the BLAS library – and propose algorithms that yield both reproducible and accurate results (correct rounding, except for triangular solver). We present implementations of these algorithms for the BLAS routines along with the performance results in parallel environments such as Intel desktop and server CPUs, Intel Xeon Phi, and both NVIDIA and AMD GPUs.
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01202396
Contributor : Roman Iakymchuk <>
Submitted on : Monday, December 21, 2015 - 10:45:26 PM
Last modification on : Friday, July 10, 2020 - 4:25:40 PM
Document(s) archivé(s) le : Saturday, April 29, 2017 - 11:31:09 PM

File

exblas.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01202396, version 3

Citation

Roman Iakymchuk, Sylvain Collange, David Defour, Stef Graillat. ExBLAS: Reproducible and Accurate BLAS Library. NRE: Numerical Reproducibility at Exascale, Nov 2015, Austin, TX, United States. ⟨hal-01202396v3⟩

Share

Metrics

Record views

834

Files downloads

630