R. Barnett, D. Payne, R. Van-de-geijn, and J. Watts, Broadcasting on Meshes with Wormhole Routing, Journal of Parallel and Distributed Computing, vol.35, issue.2, pp.111-122, 1996.
DOI : 10.1006/jpdc.1996.0074

P. Bhat, C. Raharendra, and V. Prasanna, Efficient collective communication in distributed heterogeneous systems, Journal of Parallel and Distributed Computing, vol.63, issue.3, pp.251-263, 2003.
DOI : 10.1016/S0743-7315(03)00008-X

C. Christara, X. Ding, and K. Jackson, An Efficient Transposition Algorithm for Distributed Memory Computers, Proc. High Performance Computing Systems and Applications, pp.349-368, 2000.
DOI : 10.1007/0-306-47015-2_38

M. Clement, M. Steed, and P. Crandall, Network performance modelling for PM clusters, Proceedings of Supercomputing, 1996.

D. Culler, R. Karp, D. Patterson, A. Sahay, E. Santos et al., LogP: a practical model of parallel computation, Communications of the ACM, vol.39, issue.11, pp.78-85, 1996.
DOI : 10.1145/240455.240477

D. Grove, Performance Modelling of Message-Passing Parallel Programs, 2003.

N. T. Karonis, I. Foster, B. Supinski, W. Gropp, E. Lusk et al., A Multilevel Approach to Topology- Aware Collective Operations in Computational Grids, 2002.

T. Kielmann, R. Hofman, H. Bal, A. Plaat, and R. Bhoedjang, MagPIe: MPI's Collective Communication Operations for Clustered Wide Area Systems, Proc. ACM Symposium on Principles and Practice of Parallel Programming, pp.131-140, 1999.

T. Kielmann, H. Bal, and K. Verstoep, Fast Measurement of LogP Parameters for Message Passing Platforms, 4th Workshop on Runtime Systems for Parallel Programming, pp.1176-1183, 2000.
DOI : 10.1007/3-540-45591-4_162

T. Kielmann, H. Bal, S. Gorlatch, K. Verstoep, and R. Hofman, Network performance-aware collective communication for clustered wide-area systems, 12] LAM-MPI Team. LAM/MPI Version 7, pp.1431-1456, 2001.
DOI : 10.1016/S0167-8191(01)00098-9

L. Team, Performance Issues with LAM/MPI on Linux 2.2.x, 2001.

J. Loncaric, Linux TCP Patches to improve acknowledgement policy, 2000.

B. Lowekamp, Discovery and Application of Network Information, 2000.

M. Team, MPICH Version 1.2.5, 2003.

D. Skillicorn, J. Hill, and W. Mccoll, Questions and Answers about BSP, Scientific Programming, pp.249-274, 1997.
DOI : 10.1155/1997/532130

R. Thakur and W. Gropp, Improving the Performance of Collective Operations in MPICH, Proc. of the Euro PVM/MPI 2003, pp.257-267, 2003.
DOI : 10.1007/978-3-540-39924-7_38

S. Vadhiyar, G. Fagg, and J. Dongarra, Automatically Tuned Collective Communications, ACM/IEEE SC 2000 Conference (SC'00), 2000.
DOI : 10.1109/SC.2000.10024

L. G. Valiant, A bridging model for parallel computation, Communications of the ACM, vol.33, issue.8, pp.103-111, 1990.
DOI : 10.1145/79173.79181