T. [. Brandfass, T. Alrutz, and . Gerhold, Rank reordering for MPI communication optimization, Computers & Fluids, vol.80, 2012.
DOI : 10.1016/j.compfluid.2012.01.019

E. [. Bailey, L. Barszcz, H. D. Dagum, and . Simon, NAS Parallel Benchmark Results, 1994.
DOI : 10.1007/978-94-011-5412-3_14

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.107.17

. Bcom-+-10-]-f, J. Broquedis, S. Clet-ortega, N. Moreaud, B. Furmento et al., Hwloc: a Generic Framework for Managing Hardware Anities in HPC Applications, Proceedings of the 18th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, 2010.

P. Balaji, R. Gupta, A. Vishnu, and P. H. Beckman, Mapping communication layouts to network hardware characteristics on massive-scale blue gene systems, Computer Science - Research and Development, vol.49, issue.2???3, pp.3-4247256, 2011.
DOI : 10.1007/s00450-011-0168-y

H. Chen, W. Chen, J. Huang, B. Robert, and H. Kuhn, MPIPP, Proceedings of the 20th annual international conference on Supercomputing , ICS '06, p.353360, 2006.
DOI : 10.1145/1183401.1183451

[. D. Inria, G. Buntinas, W. Mercier, and . Gropp, Implementation and Evaluation of Shared- Memory Communication and Synchronization Operations in MPICH2 using the Nemesis Communication Subsystem, Parallel Computing, vol.33, issue.9, p.634644, 2006.

[. Solt, A Prole Based Approach for Topology Aware MPI Rank Placement, 2007.

[. Duesterwald, R. W. Wisniewski, P. F. Sweeney, G. Cascaval, and S. E. Smith, Method and System for Optimizing Communication in MPI Programs for an Execution Environment, 2008.

[. Pellegrini, Static mapping by dual recursive bipartitioning of process architecture graphs, Proceedings of IEEE Scalable High Performance Computing Conference, p.486493, 1994.
DOI : 10.1109/SHPCC.1994.296682

T. S. Graham and . Woodall, Open MPI: Goals, concept, and design of a next generation MPI implementation, Proceedings of the 11th European PVM/MPI Users' Group Meeting, p.97104, 2004.

]. T. Hat98 and . Hatazaki, Rank Reordering Strategy for MPI Topology Creation Functions

V. In, J. Alexandrov, and . Dongarra, Recent Advances in Parallel Virtual Machine and Message Passing Interface, Lecture Notes in Computer Science, vol.1497, pp.188195-188205, 1998.

R. [. Hendrickson and . Leland, The Chaco User's Guide: Version 2.0, 1994.

. Trä, The Scalable Process Topology Interface of MPI 2.2. Concurrency and Computation: Practice and Experience, p.293310, 2011.

M. [. Hoeer and . Snir, Generic Topology Mapping Strategies for Large-Scale Parallel Architectures, ICS, p.7584, 2011.

J. [. Hursey, T. Squyres, and . Dontje, Locality-Aware Parallel Process Mapping for Multi-core HPC Systems, 2011 IEEE International Conference on Cluster Computing, p.527531, 2011.
DOI : 10.1109/CLUSTER.2011.59

K. [. Ito, K. Goto, and . Ono, Automatically optimized core mapping to subdomains of domain decomposition method on multicore parallel environments, Computers & Fluids, vol.80
DOI : 10.1016/j.compfluid.2012.04.024

J. L. Trä, Implementing the MPI Process Topology Mechanism, Supercomputing`02 Supercomputing`02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing, p.1

S. E. Li, A. Clark, M. Ud-doula, and . Mclow, Simulating Radiating and Magnetized Flows in Multiple Dimensions with ZEUS-MP, The Astrophysical Journal Supplement, vol.165, issue.1, p.188228, 2006.

G. [. Jeannot and . Mercier, Near-Optimal Placement of MPI Processes on Hierarchical NUMA Architectures, Euro-Par 2010 -Parallel Processing, 16th International Euro-Par Conference, 2010.
DOI : 10.1007/978-3-642-15291-7_20

URL : https://hal.archives-ouvertes.fr/inria-00544346

S. [. Kale and . Krishnan, CHARM++: A Portable Concurrent Object Oriented System Based on C++, Proceedings of Object-Oriented Programming, Systems, Languages and Applications (OOPSLA) 93, p.91108, 1993.

V. [. Karypis and . Kumar, METIS -Unstructured Graph Partitioning and Sparse Matrix Ordering System, Version 2.0, 1995.

M. Kneser, Aufgabe 300, Jahresber. Deutsch. Math. -Verein, vol.58, 1955.

T. [. Kako, T. Ono, M. M. Hirata, and . Halldorsson, Approximation Algorithms for the Weighted Independent Set Problem, LNCS, number 3787, p.341350
DOI : 10.1007/11604686_30

J. [. Mercier and . Clet-ortega, Towards an Ecient Process Placement Policy for MPI Applications in Multicore Environments, EuroPVM/MPI, p.104115, 2009.

T. [. Ma, G. Hérault, J. Bosilca, and . Dongarra, Process Distance-Aware Adaptive MPI Collective Communications, 2011 IEEE International Conference on Cluster Computing, p.196204, 2011.
DOI : 10.1109/CLUSTER.2011.30

E. [. Mercier and . Jeannot, Improving MPI Applications Performance on Multicore Clusters with Rank Reordering, EuroMPI, p.3949, 2011.
DOI : 10.1007/978-3-642-24449-0_7

URL : https://hal.archives-ouvertes.fr/hal-00643151

C. Ma, Y. M. Teo, V. March, N. Xiong, I. R. Pop et al., An approach for matching communication patterns in parallel applications, 2009 IEEE International Symposium on Parallel & Distributed Processing, 2009.
DOI : 10.1109/IPDPS.2009.5161035

[. Prace, The scientic case for high performance computing in Europe

. J. Rgb-+-11-]-m, J. Rashti, P. Green, A. Balaji, W. Afsahi et al., Multi-core and Network Aware MPI Topology Functions, EuroMPI, p.5060, 2011.

E. Rodrigues, F. Madruga, P. Navaux, and J. Panetta, Multicore Aware Process Mapping and its Impact on Communication Overhead of Parallel Applications, Proceedings of the IEEE Symp. on Comp. and Comm, p.811817, 2009.

B. [. Smith and . Bode, Performance Eects of Node Mappings on the IBM BlueGene/L Machine, Euro-Par, p.10051013, 2005.

. Spk-+-12-]-h, S. Subramoni, K. Potluri, B. Kandalla, J. Barth et al., Design of a Scalable Inniband Topology Service to Enable Network-Topology-Aware Placement of Processes, Proceedings of the 2012 ACM/IEEE conference on Supercomputing (CDROM), p.12, 2012.

[. Consortium, UPC Language Specications, v1.2, 2005.

I. [. Yu, J. E. Chung, and . Moreira, Blue Gene system software---Topology mapping for Blue Gene/L supercomputer, Proceedings of the 2006 ACM/IEEE conference on Supercomputing , SC '06, p.116, 2006.
DOI : 10.1145/1188455.1188576

H. Zhu, D. Goodell, W. Gropp, and R. Thakur, Hierarchical Collectives in MPICH2, Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, pp.325-326, 2009.
DOI : 10.1007/978-3-642-03770-2_41

J. [. Zhang, W. Zhai, W. Chen, and . Zheng, Process Mapping for MPI Collective Communications, Euro-Par, p.8192, 2009.
DOI : 10.1109/71.642949