Communication in task-parallel ILU-preconditioned CG solvers using MPI+OmpSs. Concurrency and Computation: Practice and Experience, vol.29, p.4280, 2017. ,
High-precision computation: applications and challenges, Proceedings of ARITH-21, p.1, 2013. ,
Iterationfusing conjugate gradient for sparse linear systems with mpi + ompss, Journal of Supercomputing DOI, p.10, 2019. ,
,
Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, 1994. ,
Highprecision anchored accumulators for reproducible floatingpoint summation, IEEE Transactions on Computers, vol.68, issue.7, pp.967-978, 2019. ,
Accelerating the solution of linear systems by iterative refinement in three precisions, SIAM J. Sci. Comput, vol.40, issue.2, pp.817-847, 2018. ,
Numerical reproducibility for the parallel reduction on multiand many-core architectures, ParCo, vol.49, pp.83-97, 2015. ,
A floating point technique for extending the available precision, Numerische Mathematik, vol.18, issue.3, pp.224-242, 1971. ,
Fast reproducible floating-point summation, Proceedings of ARITH-21, pp.163-172, 2013. ,
Parallel Reproducible Summation, IEEE Transactions on Computers, vol.64, issue.7, pp.2060-2070, 2015. ,
A set of level 3 basic linear algebra subprograms, forum M (2019) MPI forum, vol.16, pp.1-17, 1990. ,
MPFR: A Multiple-precision Binary Floating-point Library with Correct Rounding, ACM TOMS, vol.33, issue.2, p.13, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00070266
, Matrix Computations, 2013.
Using advanced MPI: Modern features of the message-passing interface, 2014. ,
Algorithms for quad-double precision floating point arithmetic, Proceedings of ARITH-15, pp.155-162, 2001. ,
Reproducible MPI benchmarking is still not as easy as you think, IEEE Transactions on Parallel and Distributed Systems, vol.27, issue.12, pp.3617-3630, 2016. ,
Reproducibility Strategies for Parallel Preconditioned Conjugate Gradient. JCAM Available online 2nd, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02391618
ExBLAS: Reproducible and accurate BLAS library, Proceedings of the NRE2015 workshop held as part of SC15, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01140280
, ExBLAS (Exact BLAS) library. Available on the WWW, 2017.
Reproducible and Accurate Matrix Multiplication, LNCS, vol.9553, pp.126-137, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01539180
Hierarchical Approach for Deriving a Reproducible Unblocked LU factorization, IJHPCA, vol.33, issue.5, p.1419813, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-01419813
, IEEE Standard for Floating-Point Arithmetic, IEEE Computer Society, pp.754-2008, 2008.
Numerical reproducibility based on minimal-precision validation, the CRE2019 workshop held as part of SC19, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02401878
The Art of Computer Programming: Seminumerical Algorithms, vol.2, 1969. ,
The Exact Dot Product As Basic Tool for Long Interval Arithmetic, Computing, vol.91, issue.3, pp.307-313, 2011. ,
Computer arithmetic and validity, de Gruyter Studies in Mathematics, vol.33, 2013. ,
Basic linear algebra subprograms for Fortran usage, ACM TOMS, vol.5, issue.3, pp.308-323, 1979. ,
High-precision anchored accumulators for reproducible floating-point summation, Proceedings of ARITH-24, pp.98-105, 2017. ,
Accurate and reproducible blas routines with ozaki scheme for many-core architectures, Proc. International Conference on Parallel Processing and Applied Mathematics (PPAM2019), vol.12043, pp.516-527, 2020. ,
Reproducible tall-skinny QR, Proceedings of ARITH-22, pp.152-159, 2015. ,
Accurate sum and dot product, SIAM J. Sci. Comput, vol.26, pp.1955-1988, 2005. ,
The OpenMP API specification for parallel programming, 2019. ,
Error-free transformations of matrix multiplication by using fast routines of matrix multiplication and its applications, Numerical Algorithms, vol.59, issue.1, pp.95-118, 2012. ,
Algorithms for arbitrary precision floating point arithmetic, 10th IEEE Symposium on Computer Arithmetic. IEEE, pp.132-143, 1991. ,
Accurate floating-point summation part i: Faithful rounding, SIAM J. Sci. Comput, vol.31, issue.1, pp.189-224, 2008. ,
Accurate floating-point summation part ii: Sign, k-fold faithful and rounding to nearest, SIAM J. Sci. Comput, vol.31, issue.2, pp.1269-1302, 2008. ,
Fast high precision summation. Nonlinear Theory and Its Applications, IEICE, vol.1, issue.1, pp.2-24, 2010. ,
Iterative methods for sparse linear systems, 2003. ,
Reproducibility, accuracy and performance of the feltor code and library on parallel computer architectures, Computer Physics Communications, vol.238, pp.145-156, 2019. ,
Genaue lösung linearer gleichungssysteme, GAMM Mitt. Ges. Angew. Math. Mech, vol.26, pp.7-107, 2003. ,