Designing bitreproducible portable high-performance applications, Proceedings of IPDPS'14, pp.1235-1244, 2014. ,
DOI : 10.1109/ipdps.2014.127
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.644.7828
Numerical reproducibility for the parallel reduction on multi- and many-core architectures, Parallel Computing, vol.49, pp.83-97, 2015. ,
DOI : 10.1016/j.parco.2015.09.001
Fast Reproducible Floating-Point Summation, 2013 IEEE 21st Symposium on Computer Arithmetic, pp.163-172, 2013. ,
DOI : 10.1109/ARITH.2013.9
Parallel Reproducible Summation, IEEE Transactions on Computers, vol.64, issue.7, pp.2060-2070, 2015. ,
DOI : 10.1109/TC.2014.2345391
Top Ten ExaScale Research Challenges, 2014. ,
Reproducible and Accurate Matrix Multiplication for GPU Accelerators, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01102877
Reproducible Triangular Solvers for High-Performance Computing, 2015 12th International Conference on Information Technology, New Generations, pp.353-358, 2015. ,
DOI : 10.1109/ITNG.2015.63
URL : https://hal.archives-ouvertes.fr/hal-01116588
The exact dot product as basic tool for long interval arithmetic, Computing, vol.205, issue.3, pp.307-313, 2011. ,
DOI : 10.1007/s00607-010-0127-7
Design, implementation and testing of extended and mixed precision BLAS, ACM Trans. Math. Softw, vol.28, issue.2, pp.152-205, 2002. ,
Fast exact summation using small and large superaccumulators, 2015. ,