B. Brandfass, T. Alrutz, and T. Gerhold, Rank reordering for MPI communication optimization, Computers & Fluids, vol.80, 2012.
DOI : 10.1016/j.compfluid.2012.01.019

F. Broquedis, J. Clet-ortega, S. Moreaud, N. Furmento, B. Goglin et al., hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, 2010.
DOI : 10.1109/PDP.2010.67

URL : https://hal.archives-ouvertes.fr/inria-00429889

D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser et al., Logp: Towards a realistic model of parallel computation, SIGPLAN Not, vol.28, issue.7, p.112, 1993.
DOI : 10.1145/155332.155333

URL : http://www.crhc.uiuc.edu/ece412/papers/logp.pdf

B. Goglin, J. Hursey, and J. M. Squyres, Netloc: Towards a Comprehensive View of the HPC System Topology, 2014 43rd International Conference on Parallel Processing Workshops, p.216225, 2014.
DOI : 10.1109/ICPPW.2014.38

URL : https://hal.archives-ouvertes.fr/hal-01010599

T. Hatazaki, Rank reordering strategy for MPI topology creation functions, Recent Advances in Parallel Virtual Machine and Message Passing Interface, p.188195
DOI : 10.1007/BFb0056575

R. W. Hockney, The communication challenge for MPP: Intel Paragon and Meiko CS-2, Parallel Computing, vol.20, issue.3
DOI : 10.1016/S0167-8191(06)80021-9

J. Hursey, J. M. Squyres, and T. Dontje, Locality-Aware Parallel Process Mapping for Multi-core HPC Systems, 2011 IEEE International Conference on Cluster Computing, p.527531, 2011.
DOI : 10.1109/CLUSTER.2011.59

J. L. Trä, Implementing the MPI Process Topology Mechanism, Supercomputing`02Supercomputing`02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002.

E. Jeannot, G. Mercier, and F. Tessier, Process Placement in Multicore Clusters: Algorithmic Issues and Practical Techniques, IEEE Trans. Parallel Distrib. Syst, vol.25, issue.4, p.9931002, 2014.
DOI : 10.1109/tpds.2013.104

URL : https://hal.archives-ouvertes.fr/hal-00803548

T. Jesper-larsson, Implementing the MPI process topology mechanism, Supercom- puting`02puting`02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002.

T. Kielmann, H. E. Bal, and K. Verstoep, Fast Measurement of LogP Parameters for Message Passing Platforms, p.11761183, 2000.
DOI : 10.1007/3-540-45591-4_162

URL : http://www.cs.vu.nl/~kielmann/papers/rtspp00.ps.gz

G. Mercier and J. Clet-ortega, Towards an Ecient Process Placement Policy for MPI Applications in Multicore Environments, EuroPVM/MPI, p.104115, 2009.
DOI : 10.1007/978-3-642-03770-2_17

URL : https://hal.inria.fr/inria-00392581/document/

G. Mercier and E. Jeannot, Improving MPI Applications Performance on Multicore Clusters with Rank Reordering, EuroMPI, p.3949, 2011.
DOI : 10.1145/1183401.1183451

URL : https://hal.archives-ouvertes.fr/hal-00643151

. Plafrim, Plate-forme fédérative pour la recherche en informatique et mathématiques

J. Quintin, K. Hasanov, and A. Lastovetsky, Hierarchical Parallel Matrix Multiplication on Large-Scale Distributed Memory Platforms, 2013 42nd International Conference on Parallel Processing, pp.754762-754763, 2013.
DOI : 10.1109/ICPP.2013.89

URL : http://arxiv.org/pdf/1306.4161.pdf

M. J. Rashti, J. Green, P. Balaji, A. Afsahi, and W. Gropp, Multi-core and Network Aware MPI Topology Functions, EuroMPI 2011. Recent Advances in the Message Passing Interface -18th European MPI Users' Group Meeting, p.5060
DOI : 10.1109/PDP.2010.67

URL : http://post.queensu.ca/~afsahi/PPRL/papers/EuroMPI-2011.pdf

J. Reinders and J. Jeers, High Performance Parallelism Pearls, 2015.

R. A. Van-de-geijn and J. Watts, SUMMA: scalable universal matrix multiplication algorithm, Concurrency: Practice and Experience, vol.9, issue.4, p.255274, 1997.
DOI : 10.1002/(SICI)1096-9128(199704)9:4<255::AID-CPE250>3.0.CO;2-2

J. Zhang, J. Zhai, W. Chen, and W. Zheng, Process Mapping for MPI Collective Communications, Euro-Par, p.8192, 2009.
DOI : 10.1109/ICPP.2005.62

URL : http://hpc.cs.tsinghua.edu.cn/research/cluster/papers_cwg/europar_zhang.pdf

H. Zhu, D. Goodell, W. Gropp, and R. Thakur, Hierarchical Collectives in MPICH2, Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, p.325326, 2009.
DOI : 10.1109/JSSC.2007.910957

URL : http://www.mcs.anl.gov/uploads/cels/papers/P1622.pdf