, Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, 1994.
Iterative methods for sparse linear systems, 2003. ,
Using advanced MPI: Modern features of the message-passing interface, 2014. ,
Basic linear algebra subprograms for Fortran usage, ACM TOMS, vol.5, pp.308-323, 1979. ,
A set of level 3 basic linear algebra subprograms, ACM TOMS, vol.16, pp.1-17, 1990. ,
Numerical reproducibility for the parallel reduction on multi-and many-core architectures, ParCo, vol.49, pp.83-97, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-00949355
Parallel Reproducible Summation, IEEE Transactions on Computers, vol.64, pp.2060-2070, 2015. ,
, Hierarchical Approach for Deriving a Reproducible LU factorization, p.1419813, 2019.
URL : https://hal.archives-ouvertes.fr/hal-01382645
Reproducible and Accurate Matrix Multiplication, LNCS, vol.9553, pp.126-137, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01539180
Accuracy and stability of numerical algorithms, 2002. ,
Handbook of Floating-Point Arithmetic, Birkhäuser, 2010. ,
Accurate floating-point summation part ii: Sign, k-fold faithful and rounding to nearest, SIAM J. Sci. Comput, vol.31, pp.1269-1302, 2008. ,
Computer-assisted proofs and self-validating methods, Handbook on Accuracy and Reliability in Scientific Computation, SIAM, pp.195-240, 2005. ,
High-precision computation: applications and challenges, Proceedings of ARITH-21, p.1, 2013. ,
High-precision anchored accumulators for reproducible floating-point summation, Proceedings of ARITH-24, pp.98-105, 2017. ,
High-precision anchored accumulators for reproducible floating-point summation, IEEE Transactions on Computers, 2019. ,
, IEEE Standard for Floating-Point Arithmetic, pp.754-2008, 2008.
Accurate and reproducible blas routines with ozaki scheme for many-core architectures, Proc. International Conference on Parallel Processing and Applied Mathematics (PPAM2019), 2019. ,
The Art of Computer Programming: Seminumerical Algorithms, vol.2, 1969. ,
Accurate sum and dot product, SIAM J. Sci. Comput, vol.26, pp.1955-1988, 2005. ,
The Exact Dot Product As Basic Tool for Long Interval Arithmetic, Computing, vol.91, pp.307-313, 2011. ,
ExBLAS: Reproducible and accurate BLAS library, Proceedings of the NRE2015 workshop held as part of SC15, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01140280
Algorithms for quad-double precision floating point arithmetic, Proceedings of ARITH-15, pp.155-162, 2001. ,
Emulation of a FMA and Correctly Rounded Sums: Proved Algorithms Using Rounding to Odd, IEEE Transactions on Computers, vol.57, pp.462-471, 2008. ,
URL : https://hal.archives-ouvertes.fr/inria-00080427
Algorithms for arbitrary precision floating point arithmetic, 10th IEEE Symposium on Computer Arithmetic, pp.132-143, 1991. ,
, Computer Physics Communications, vol.238, pp.145-156, 2019.
, HPCG -high performance Conjugate Gradients, 2015.
Toward a new metric for ranking high performance computing systems, 2013. ,
HPL -a portable implementation of the high-performance Linpack benchmark for distributedmemory computers, 2008. ,
MPFR: A Multiple-precision Binary Floating-point Library with Correct Rounding, ACM TOMS, vol.33, p.13, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00070266
Fast high precision summation, Nonlinear Theory and Its Applications, IEICE, vol.1, pp.2-24, 2010. ,
Fast reproducible floating-point summation, Proceedings of ARITH-21, pp.163-172, 2013. ,
Reproducible tall-skinny QR, Proceedings of ARITH-22, pp.152-159, 2015. ,
Error-free transformations of matrix multiplication by using fast routines of matrix multiplication and its applications, Numerical Algorithms, vol.59, pp.95-118, 2012. ,
Accelerating the solution of linear systems by iterative refinement in three precisions, SIAM J. Sci. Comput, vol.40, pp.817-847, 2018. ,