A. Alexandrov, M. Ionescu, K. Schauser, and C. Scheiman, LogGP, Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures , SPAA '95, 1995.
DOI : 10.1145/215399.215427

O. Beaumont, V. Boudet, F. Rastello, and Y. Robert, Matrix multiplication on heterogeneous platforms, IEEE Transactions on Parallel and Distributed Systems, vol.12, issue.10, pp.1033-1051, 2001.
DOI : 10.1109/71.963416

URL : https://hal.archives-ouvertes.fr/hal-00808288

P. Bhat, V. K. Prasanna, and C. Raghavendra, Adaptive communication algorithms for distributed heterogeneous systems, Proceedings of the IEEE International Symposium on High Performance Distributed Computing, 1998.

F. Cappello, P. Fraigniaud, B. Mans, and A. Rosenberg, AN ALGORITHMIC MODEL FOR HETEROGENEOUS HYPER-CLUSTERS: RATIONALE AND EXPERIENCE, International Journal of Foundations of Computer Science, vol.16, issue.02, pp.195-215, 2005.
DOI : 10.1142/S0129054105002942

Z. Chen, J. Dongarra, P. Luszczek, and K. Roche, Self-adapting software for numerical linear algebra and LAPACK for clusters, Parallel Computing, vol.29, issue.11-12, pp.11-12, 2003.
DOI : 10.1016/j.parco.2003.05.014

J. Cuenca, D. Giménez, J. González, J. Dongarra, and K. Roche, Automatic optimisation of parallel linear algebra routines in systems with variable load, Eleventh Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2003. Proceedings., 2003.
DOI : 10.1109/EMPDP.2003.1183618

D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser et al., LogP: a practical model of parallel computation, Communications of the ACM, vol.39, issue.11, pp.78-85, 1996.
DOI : 10.1145/240455.240477

F. Desprez and F. Suter, Impact of Mixed- Parallelism on Parallel Implementations of Strassen and Winograd Matrix Multiplication Algorithms', Concurrency and Computation: practice and experience, pp.771-797, 2004.

L. E. Dubois, A. Legrand, M. Quinson, and F. Vivien, A first step towards automatically building network representations', Euro-Par, pp.160-169, 2007.

J. Faik, J. D. Teresco, K. D. Devine, J. E. Flaherty, and L. G. Gervasio, A model for resource-aware load balancing on heterogeneous clusters, 2005.

M. I. Frank, M. Agarwal, and M. K. Vernon, LoPC: Modeling contention in parallel algorithms, Proc. of 6th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 1997.

M. Frigo and S. G. Johnson, FFTW: an adaptive software architecture for the fast fourier transform, Proceedings of ICASSP, 1998.

O. Hartmann, M. Kuhnemann, T. Rauber, and G. Runger, Adaptive selection of communication methods to optimize collective MPI operations, Proc. of the 12th Workshop on Compilers for Parallel Computers (CPC'06), 2006.

R. Hockney, The communication challenge for MPP: Intel Paragon and Meiko CS-2, Parallel Computing, vol.20, issue.3, pp.389-398, 1994.
DOI : 10.1016/S0167-8191(06)80021-9

B. Hong and V. K. Prasanna, Adaptive matrix multiplication in heterogeneous environments, Ninth International Conference on Parallel and Distributed Systems, 2002. Proceedings., 2002.
DOI : 10.1109/ICPADS.2002.1183389

S. Hunold, T. Rauber, and G. Runger, Multilevel hierarchical matrix multiplication on clusters, Proceedings of the 18th annual international conference on Supercomputing , ICS '04, 2004.
DOI : 10.1145/1006209.1006230

T. Kielmann, H. Bal, S. Gorlatch, K. Verstoep, and R. Hofman, Network performance-aware collective communication for clustered wide-area systems, Parallel Computing, vol.27, issue.11, pp.1431-1456, 2001.
DOI : 10.1016/S0167-8191(01)00098-9

A. L. Lastovetsky, I. Mkwawa, O. Flynn, and M. , An accurate communication model of a heterogeneous cluster based on a switch-enabled Ethernet network, 12th International Conference on Parallel and Distributed Systems, (ICPADS'06), 2006.
DOI : 10.1109/ICPADS.2006.24

B. Lowekamp and A. Beguelin, ECO: Efficient Collective Operations for communication on heterogeneous networks, Proceedings of International Conference on Parallel Processing, 1996.
DOI : 10.1109/IPPS.1996.508087

M. O. Mccracken, A. Snavely, and A. Malony, Performance modeling for dynamic algorithm selection', Intl, Conference on Computational Science, 2003.

C. A. Moritz and M. I. Frank, LoGPG: Modeling network contention in message-passing programs, IEEE Transactions on Parallel and Distributed Systems, vol.12, issue.4, pp.404-415, 2001.
DOI : 10.1109/71.920589

W. Nasri, D. Trystram, and S. Achour, Adaptive algorithms for the parallelization of the dense matrix multiplication on clusters, Internat. J. of Computational Science and Engineering, Special Issue on best, 2007.

Y. Ohtaki, D. Takahashi, T. Boku, and M. Sato, Parallel implementation of Strassen's matrix multiplication algorithm for heterogeneous clusters, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings., 2004.
DOI : 10.1109/IPDPS.2004.1303066

J. Pjesivac-grbovic, T. Angskun, G. Bosilca, G. E. Fagg, E. Gabriel et al., Performance Analysis of MPI Collective Operations, 19th IEEE International Parallel and Distributed Processing Symposium, 2005.
DOI : 10.1109/IPDPS.2005.335

J. Pjesivac-grbovic, G. Bosilca, G. E. Fagg, T. Angskun, and J. J. Dongarra, Decision Trees and MPI Collective Algorithm Selection Problem, Proceedings of Euro-Par 2007, pp.107-117, 2007.
DOI : 10.1007/978-3-540-74466-5_13

L. A. Steffenel and G. Mounié, Identifying logical homogeneous clusters for efficient wide-area communication, Proc. of the Euro PVM/MPI 2004, pp.319-326, 2004.

L. A. Steffenel and G. Mounié, Scheduling heuristics for efficient broadcast operations on grid environments, Proc. of the Performance Modeling, Evaluation and Optimization of Parallel and Distributed Systems Workshop -PMEO'06, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00022008

L. A. Steffenel, Modeling network contention effects on alltoall operations, Proc. of the IEEE Conference on Cluster Computing, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00089242

J. D. Teresco, J. Faik, and J. E. Flaherty, Resource-Aware Scientific Computation on a Heterogeneous Cluster, Computing in Science and Engineering, vol.7, issue.2, pp.40-50, 2005.
DOI : 10.1109/MCSE.2005.38

N. Thomas, G. Tanase, O. Tkachyshyn, J. Perdue, N. M. Amato et al., A framework for adaptive algorithm selection in STAPL, Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming , PPoPP '05, 2005.
DOI : 10.1145/1065944.1065981

S. Vadhiyar, G. Fagg, and J. Dongarra, Towards an accurate model for collective communications, International Journal of High Performance Computing Applications, vol.8, issue.1, pp.159-167, 2004.

R. C. Whaley, A. Petitet, and J. J. Dongarra, Automated empirical optimizations of software and the ATLAS project, Parallel Computing, vol.27, issue.1-2, pp.3-35, 2001.
DOI : 10.1016/S0167-8191(00)00087-9

R. Wolski, N. Spring, and C. Peterson, Implementing a performance forecasting system for metacomputing, Proceedings of the 1997 ACM/IEEE conference on Supercomputing (CDROM) , Supercomputing '97, 1997.
DOI : 10.1145/509593.509600