Q. Ali, V. S. Pai, and S. P. Midkiff, Advanced collective communication in aspen, Proceedings of the 22nd annual international conference on Supercomputing , ICS '08, pp.83-93, 2008.
DOI : 10.1145/1375527.1375543

A. Bar-noy, J. Bruck, C. Ho, S. Kipnis, and B. Schieber, Computing global combine operations in the multiport postal model, IEEE Transactions on Parallel and Distributed Systems, vol.6, issue.8, pp.896-900, 1995.
DOI : 10.1109/71.406965

A. Bar-noy, S. Kipnis, and B. Schieber, AN OPTIMAL ALGORITHM FOR COMPUTING CENSUS FUNCTIONS IN MESSAGE-PASSING SYSTEMS, Parallel Processing Letters, vol.03, issue.01, pp.19-23, 1993.
DOI : 10.1142/S0129626493000046

J. Bruck and C. Ho, EFFICIENT GLOBAL COMBINE OPERATIONS IN MULTI-PORT MESSAGE-PASSING SYSTEMS, Parallel Processing Letters, vol.03, issue.04, pp.335-346, 1993.
DOI : 10.1142/S012962649300037X

E. W. Chan, M. F. Heimlich, A. Purkayastha, and R. A. Van-de-geijn, Collective communication: theory, practice, and experience, Concurrency and Computation: Practice and Experience, vol.49, issue.13, pp.1749-1783, 2007.
DOI : 10.1002/cpe.1206

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.187.9821

T. Hoefler, A. Lumsdaine, and J. Dongarra, Towards Efficient MapReduce Using MPI, Matti Ropo Recent Advances in Parallel Virtual Inria Machine and Message Passing Interface, pp.240-249, 2009.
DOI : 10.1007/978-3-642-03770-2_30

S. , L. Johnsson, and C. Ho, Optimum broadcasting and personalized communication in hypercubes, IEEE Transactions on Computers, vol.38, issue.9, pp.1249-1268, 1989.

T. Kielmann, R. F. Hofman, H. E. Bal, A. Plaat, and R. A. Bhoedjang, MPI's reduction operations in clustered wide area systems, 1999.

E. Donald and . Knuth, The art of computer programming) sorting and searching, 1998.

A. Legrand, L. Marchal, and Y. Robert, Optimizing the steady-state throughput of scatter and reduce operations on heterogeneous platforms, Journal of Parallel and Distributed Computing, vol.65, issue.12, pp.1497-1514, 2005.
DOI : 10.1016/j.jpdc.2005.05.021

URL : https://hal.archives-ouvertes.fr/hal-00789425

P. Liu, M. Kuo, and D. Wang, An Approximation Algorithm and Dynamic Programming for Reduction in Heterogeneous Environments, Algorithmica, vol.33, issue.4, pp.425-453, 2009.
DOI : 10.1007/s00453-007-9113-7

J. Pjesivac-grbovic, T. Angskun, G. Bosilca, G. E. Fagg, E. Gabriel et al., Performance Analysis of MPI Collective Operations, 19th IEEE International Parallel and Distributed Processing Symposium, 2005.
DOI : 10.1109/IPDPS.2005.335

S. Plimpton and K. Devine, MapReduce-MPI Library

R. Rabenseifner, Optimization of Collective Reduction Operations, Computational Science -ICCS 2004, pp.1-9, 2004.
DOI : 10.1007/978-3-540-24685-5_1

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.64.7050

R. Rabenseifner and J. Träff, More Efficient Reduction Algorithms for Non-Powerof-Two Number of Processors in Message-Passing Parallel Systems, Recent advances in parallel virtual machine and message passing interface, 2004.

H. Ritzdorf and J. Träff, Collective operations in NEC's high-performance MPI libraries, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium, 2006.
DOI : 10.1109/IPDPS.2006.1639334

P. Sanders, J. Speck, and J. Träff, Two-tree algorithms for full bandwidth broadcast, reduction and scan, Parallel Computing, vol.35, issue.12, pp.581-594, 2009.
DOI : 10.1016/j.parco.2009.09.001

R. Thakur and R. Rabenseifner, Optimization of Collective Communication Operations in MPICH, International Journal of High Performance Computing Applications, vol.19, issue.1, pp.49-66, 2005.
DOI : 10.1177/1094342005051521

A. Robert and . Van-de-geijn, On global combine operations, Journal of Parallel and Distributed Computing, vol.22, issue.2, pp.324-328, 1994.