J. I. Aliaga, J. Perez, and E. S. Quintana-orti, Systematic Fusion of CUDA Kernels for Iterative Sparse Linear System Solvers, Euro-Par 2015: Parallel Processing -21st International Conference on Parallel and Distributed Computing Proceedings, pp.675-686, 2015.
DOI : 10.1007/978-3-662-48096-0_52

J. I. Aliaga, J. Perez, E. S. Quintana-orti, and H. Anzt, Reformulated Conjugate Gradient for the Energy-Aware Solution of Linear Systems on GPUs, 2013 42nd International Conference on Parallel Processing, pp.320-329, 2013.
DOI : 10.1109/ICPP.2013.41

H. Anzt, E. Chow, and J. Dongarra, Iterative Sparse Triangular Solves for Preconditioning, Euro-Par 2015: Parallel Processing, pp.650-661, 2015.
DOI : 10.1007/978-3-662-48096-0_50

H. Anzt, S. Tomov, and J. Dongarra, Energy efficiency and performance frontiers for sparse computations on GPU supercomputers, Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM '15, pp.1-10, 2015.
DOI : 10.1177/1094342010385729

F. Archambeau, N. Méchitoua, and M. Sakiz, Code Saturne: A Finite Volume Code for the computation of turbulent incompressible flows -Industrial Applications, International Journal on Finite Volumes, vol.1, issue.1, 2004.
URL : https://hal.archives-ouvertes.fr/hal-01115371

E. Chow, H. Anzt, and J. Dongarra, Asynchronous Iterative Algorithm for Computing Incomplete Factorizations on GPUs, In Lecture Notes in Computer Science, vol.9137, pp.1-16, 2015.
DOI : 10.1007/978-3-319-20119-1_1

E. Chow and A. Patel, Fine-Grained Parallel Incomplete LU Factorization, SIAM Journal on Scientific Computing, vol.37, issue.2, pp.169-193, 2015.
DOI : 10.1137/140968896

K. Rupp, F. Rudolf, and J. Weinbub, ViennaCL -A High Level Linear Algebra Library for GPUs and Multi-Core CPUs, Intl. Workshop on GPUs and Scientific Applications, pp.51-56, 2010.

Y. Saad, Iterative Methods for Sparse Linear Systems, Society for Industrial and Applied Mathematics, 2003.
DOI : 10.1137/1.9780898718003