J. I. Aliaga, M. Barreda, G. Flegar, M. Bollhöfer, and Q. Es, Communication in task-parallel ILU-preconditioned CG solvers using MPI+OmpSs. Concurrency and Computation: Practice and Experience, vol.29, p.4280, 2017.

D. H. Bailey, High-precision computation: applications and challenges, Proceedings of ARITH-21, p.1, 2013.

M. Barreda, J. Aliaga, V. Beltran, and M. Casas, Iterationfusing conjugate gradient for sparse linear systems with mpi + ompss, Journal of Supercomputing DOI, p.10, 2019.


R. Barrett, M. Berry, T. F. Chan, J. Demmel, J. Donato et al., Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, 1994.

N. Burgess, C. Goodyer, D. R. Lutz, and C. N. Hinds, Highprecision anchored accumulators for reproducible floatingpoint summation, IEEE Transactions on Computers, vol.68, issue.7, pp.967-978, 2019.

C. E. Higham and N. J. , Accelerating the solution of linear systems by iterative refinement in three precisions, SIAM J. Sci. Comput, vol.40, issue.2, pp.817-847, 2018.

S. Collange, D. Defour, S. Graillat, and R. Iakymchuk, Numerical reproducibility for the parallel reduction on multiand many-core architectures, ParCo, vol.49, pp.83-97, 2015.

T. J. Dekker, A floating point technique for extending the available precision, Numerische Mathematik, vol.18, issue.3, pp.224-242, 1971.

J. Demmel and H. D. Nguyen, Fast reproducible floating-point summation, Proceedings of ARITH-21, pp.163-172, 2013.

J. Demmel and H. D. Nguyen, Parallel Reproducible Summation, IEEE Transactions on Computers, vol.64, issue.7, pp.2060-2070, 2015.

J. J. Dongarra, J. D. Croz, S. Hammarling, and I. Duff, A set of level 3 basic linear algebra subprograms, forum M (2019) MPI forum, vol.16, pp.1-17, 1990.

L. Fousse, G. Hanrot, V. Lefèvre, P. Pélissier, and P. Zimmermann, MPFR: A Multiple-precision Binary Floating-point Library with Correct Rounding, ACM TOMS, vol.33, issue.2, p.13, 2007.
URL : https://hal.archives-ouvertes.fr/inria-00070266

G. H. Golub and C. Loan, Matrix Computations, 2013.

W. Gropp, T. Hoefler, R. Thakur, and E. Lusk, Using advanced MPI: Modern features of the message-passing interface, 2014.

Y. Hida, X. S. Li, and D. H. Bailey, Algorithms for quad-double precision floating point arithmetic, Proceedings of ARITH-15, pp.155-162, 2001.

S. Hunold and A. Carpen-amarie, Reproducible MPI benchmarking is still not as easy as you think, IEEE Transactions on Parallel and Distributed Systems, vol.27, issue.12, pp.3617-3630, 2016.

R. Iakymchuk, M. Barreda, M. Wiesenberger, J. I. Aliaga, and Q. Es, Reproducibility Strategies for Parallel Preconditioned Conjugate Gradient. JCAM Available online 2nd, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02391618

R. Iakymchuk, S. Collange, D. Defour, and S. Graillat, ExBLAS: Reproducible and accurate BLAS library, Proceedings of the NRE2015 workshop held as part of SC15, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01140280

R. Iakymchuk, S. Collange, D. Defour, and S. Graillat, ExBLAS (Exact BLAS) library. Available on the WWW, 2017.

R. Iakymchuk, D. Defour, C. S. Graillat, and S. , Reproducible and Accurate Matrix Multiplication, LNCS, vol.9553, pp.126-137, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01539180

R. Iakymchuk, S. Graillat, D. Defour, and Q. Es, Hierarchical Approach for Deriving a Reproducible Unblocked LU factorization, IJHPCA, vol.33, issue.5, p.1419813, 2019.
URL : https://hal.archives-ouvertes.fr/hal-01419813

, IEEE Standard for Floating-Point Arithmetic, IEEE Computer Society, pp.754-2008, 2008.

T. Imamura, D. Mukunoki, R. Iakymchuk, F. Jézéquel, and S. Graillat, Numerical reproducibility based on minimal-precision validation, the CRE2019 workshop held as part of SC19, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02401878

D. E. Knuth, The Art of Computer Programming: Seminumerical Algorithms, vol.2, 1969.

U. Kulisch and V. Snyder, The Exact Dot Product As Basic Tool for Long Interval Arithmetic, Computing, vol.91, issue.3, pp.307-313, 2011.

U. W. Kulisch, Computer arithmetic and validity, de Gruyter Studies in Mathematics, vol.33, 2013.

C. L. Lawson, R. J. Hanson, K. Dr, and F. T. Krogh, Basic linear algebra subprograms for Fortran usage, ACM TOMS, vol.5, issue.3, pp.308-323, 1979.

D. R. Lutz and C. N. Hinds, High-precision anchored accumulators for reproducible floating-point summation, Proceedings of ARITH-24, pp.98-105, 2017.

D. Mukunoki, T. Ogita, and K. Ozaki, Accurate and reproducible blas routines with ozaki scheme for many-core architectures, Proc. International Conference on Parallel Processing and Applied Mathematics (PPAM2019), vol.12043, pp.516-527, 2020.

H. D. Nguyen and J. Demmel, Reproducible tall-skinny QR, Proceedings of ARITH-22, pp.152-159, 2015.

T. Ogita, S. M. Rump, and S. Oishi, Accurate sum and dot product, SIAM J. Sci. Comput, vol.26, pp.1955-1988, 2005.

A. Openmp, The OpenMP API specification for parallel programming, 2019.

K. Ozaki, T. Ogita, S. Oishi, and S. M. Rump, Error-free transformations of matrix multiplication by using fast routines of matrix multiplication and its applications, Numerical Algorithms, vol.59, issue.1, pp.95-118, 2012.

D. M. Priest, Algorithms for arbitrary precision floating point arithmetic, 10th IEEE Symposium on Computer Arithmetic. IEEE, pp.132-143, 1991.

S. M. Rump, T. Ogita, and S. Oishi, Accurate floating-point summation part i: Faithful rounding, SIAM J. Sci. Comput, vol.31, issue.1, pp.189-224, 2008.

S. M. Rump, T. Ogita, and S. Oishi, Accurate floating-point summation part ii: Sign, k-fold faithful and rounding to nearest, SIAM J. Sci. Comput, vol.31, issue.2, pp.1269-1302, 2008.

S. M. Rump, T. Ogita, and S. Oishi, Fast high precision summation. Nonlinear Theory and Its Applications, IEICE, vol.1, issue.1, pp.2-24, 2010.

Y. Saad, Iterative methods for sparse linear systems, 2003.

M. Wiesenberger, L. Einkemmer, M. Held, A. Gutierrez-milla, X. Saez et al., Reproducibility, accuracy and performance of the feltor code and library on parallel computer architectures, Computer Physics Communications, vol.238, pp.145-156, 2019.

G. Zielke and V. Drygalla, Genaue lösung linearer gleichungssysteme, GAMM Mitt. Ges. Angew. Math. Mech, vol.26, pp.7-107, 2003.