A fast algorithm for particle simulations, Journal of Computational Physics, vol.73, issue.2, 1987. ,
A Sparse Matrix Arithmetic Based on $\Cal H$ -Matrices. Part I: Introduction to ${\Cal H}$ -Matrices, Computing, vol.62, issue.2, 1999. ,
DOI : 10.1007/s006070050015
Numerical methods for least squares problems. Siam, 1996. ,
Faithful performance prediction of a dynamic task-based runtime system for heterogeneous multi-core architectures, Concurrency and Computation: Practice and Experience, 2015. ,
DOI : 10.1002/cpe.3555
URL : https://hal.archives-ouvertes.fr/hal-01147997
Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems, ACM Transactions on Mathematical Software, vol.43, issue.2, pp.2014-2017, 2014. ,
DOI : 10.1145/2898348
URL : https://hal.archives-ouvertes.fr/hal-01333645
StarPU: A unified platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00384363
Versatile, scalable, and accurate simulation of distributed applications and platforms, Journal of Parallel and Distributed Computing, vol.74, issue.10, 2014. ,
DOI : 10.1016/j.jpdc.2014.06.008
URL : https://hal.archives-ouvertes.fr/hal-01017319
Optimizing Compilers for Modern Architectures: A Dependence-Based Approach, 2002. ,
Parallelizing dense and banded linear algebra libraries using SMPSs, Concurrency and Computation: Practice and Experience, 2009. ,
DOI : 10.1002/cpe.1463
Fully dynamic scheduler for numerical computing on multicore processors, LAPACK working note, 2009. ,
Multi-GPU and Multi-CPU Parallelization for Interactive Physics Simulations, Euro-Par, 2010. ,
DOI : 10.1007/978-3-642-15291-7_23
URL : https://hal.archives-ouvertes.fr/inria-00502448
DAGuE: A generic distributed DAG engine for high performance computing, Parallel Computing, vol.38, issue.1, 2012. ,
OmpSs: A PROPOSAL FOR PROGRAMMING HETEROGENEOUS MULTI-CORE ARCHITECTURES, Parallel Processing Letters, vol.21, issue.02, 2011. ,
DOI : 10.1142/S0129626411000151
CoreTSAR: Adaptive Worksharing for Heterogeneous Systems, Supercomputing -29th International Conference, 2014. ,
DOI : 10.1007/978-3-319-07518-1_11
Programming matrix algorithms-by-blocks for thread-level parallelism, ACM Transactions on Mathematical Software, vol.36, issue.3, 2009. ,
DOI : 10.1145/1527286.1527288
Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects, Journal of Physics: Conference Series, vol.180, issue.1, p.12037, 2009. ,
DOI : 10.1088/1742-6596/180/1/012037
Dense linear algebra on distributed heterogeneous hardware with a symbolic DAG approach, Scalable Computing and Communications: Theory and Practice, 2013. ,
Taking Advantage of Hybrid Systems for Sparse Direct Solvers via Task-Based Runtimes, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014. ,
DOI : 10.1109/IPDPSW.2014.9
URL : https://hal.archives-ouvertes.fr/hal-00925017
A Parallel Sparse Direct Solver via Hierarchical DAG Scheduling, ACM Transactions on Mathematical Software, vol.41, issue.1, 2014. ,
DOI : 10.1145/2629641
Fine-Grained Multithreading for the Multifrontal $QR$ Factorization of Sparse Matrices, SIAM Journal on Scientific Computing, vol.35, issue.4, 2013. ,
DOI : 10.1137/110846427
URL : https://hal.archives-ouvertes.fr/hal-01122471
An overview of SuperLU, ACM Transactions on Mathematical Software, vol.31, issue.3, 2005. ,
DOI : 10.1145/1089014.1089017
Performance modeling tools for parallel sparse linear algebra computations, " in Parallel Computing: From Multicores and GPU's to Petascale, 2009. ,
The multifrontal solution of indefinite sparse symmetric linear systems, ACM Transactions On Mathematical Software, vol.9, 1983. ,
A New Implementation of Sparse Gaussian Elimination, ACM Transactions on Mathematical Software, vol.8, issue.3, pp.256-276, 1982. ,
DOI : 10.1145/356004.356006
Multifrontal QR factorization in a multiprocessor environment, Int. Journal of Num. Linear Alg. and Appl, vol.3, issue.4, 1996. ,
Algorithm 915, SuiteSparseQR, ACM Transactions on Mathematical Software, vol.38, issue.1, 2011. ,
DOI : 10.1145/2049662.2049670
Task scheduling for parallel sparse Cholesky factorization, International Journal of Parallel Programming, vol.27, issue.4, 1989. ,
DOI : 10.1007/BF01407861
Bridging the Gap between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015. ,
DOI : 10.1109/IPDPSW.2015.35
URL : https://hal.archives-ouvertes.fr/hal-01120507
An effective git and orgmode based workflow for reproducible research, SIGOPS Oper. Syst. Rev, vol.49, issue.1, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01112795
Visualizing More Performance Data Than What Fits on Your Screen, Tools for High Performance Computing 2012, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00737651
Scheduling Tree-Shaped Task Graphs to Minimize Memory and Makespan, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, 2013. ,
DOI : 10.1109/IPDPS.2013.55
URL : https://hal.archives-ouvertes.fr/hal-00740105
StarPU-MPI: Task programming over clusters of machines enhanced with accelerators, " in Recent Advances in the Message Passing Interface, ser, Lecture Notes in Computer Science, J. Träff, S. Benkner, and J. Dongarra, vol.7490 ,