Rank reordering for MPI communication optimization, Computers & Fluids, vol.80, 2012. ,
DOI : 10.1016/j.compfluid.2012.01.019
NAS Parallel Benchmark Results, 1994. ,
DOI : 10.1007/978-94-011-5412-3_14
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.107.17
Hwloc: a Generic Framework for Managing Hardware Anities in HPC Applications, Proceedings of the 18th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, 2010. ,
Mapping communication layouts to network hardware characteristics on massive-scale blue gene systems, Computer Science - Research and Development, vol.49, issue.2???3, pp.3-4247256, 2011. ,
DOI : 10.1007/s00450-011-0168-y
MPIPP, Proceedings of the 20th annual international conference on Supercomputing , ICS '06, p.353360, 2006. ,
DOI : 10.1145/1183401.1183451
Implementation and Evaluation of Shared- Memory Communication and Synchronization Operations in MPICH2 using the Nemesis Communication Subsystem, Parallel Computing, vol.33, issue.9, p.634644, 2006. ,
A Prole Based Approach for Topology Aware MPI Rank Placement, 2007. ,
Method and System for Optimizing Communication in MPI Programs for an Execution Environment, 2008. ,
Static mapping by dual recursive bipartitioning of process architecture graphs, Proceedings of IEEE Scalable High Performance Computing Conference, p.486493, 1994. ,
DOI : 10.1109/SHPCC.1994.296682
Open MPI: Goals, concept, and design of a next generation MPI implementation, Proceedings of the 11th European PVM/MPI Users' Group Meeting, p.97104, 2004. ,
Rank Reordering Strategy for MPI Topology Creation Functions ,
Recent Advances in Parallel Virtual Machine and Message Passing Interface, Lecture Notes in Computer Science, vol.1497, pp.188195-188205, 1998. ,
The Chaco User's Guide: Version 2.0, 1994. ,
The Scalable Process Topology Interface of MPI 2.2. Concurrency and Computation: Practice and Experience, p.293310, 2011. ,
Generic Topology Mapping Strategies for Large-Scale Parallel Architectures, ICS, p.7584, 2011. ,
Locality-Aware Parallel Process Mapping for Multi-core HPC Systems, 2011 IEEE International Conference on Cluster Computing, p.527531, 2011. ,
DOI : 10.1109/CLUSTER.2011.59
Automatically optimized core mapping to subdomains of domain decomposition method on multicore parallel environments, Computers & Fluids, vol.80 ,
DOI : 10.1016/j.compfluid.2012.04.024
Implementing the MPI Process Topology Mechanism, Supercomputing`02 Supercomputing`02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing, p.1 ,
Simulating Radiating and Magnetized Flows in Multiple Dimensions with ZEUS-MP, The Astrophysical Journal Supplement, vol.165, issue.1, p.188228, 2006. ,
Near-Optimal Placement of MPI Processes on Hierarchical NUMA Architectures, Euro-Par 2010 -Parallel Processing, 16th International Euro-Par Conference, 2010. ,
DOI : 10.1007/978-3-642-15291-7_20
URL : https://hal.archives-ouvertes.fr/inria-00544346
CHARM++: A Portable Concurrent Object Oriented System Based on C++, Proceedings of Object-Oriented Programming, Systems, Languages and Applications (OOPSLA) 93, p.91108, 1993. ,
METIS -Unstructured Graph Partitioning and Sparse Matrix Ordering System, Version 2.0, 1995. ,
Aufgabe 300, Jahresber. Deutsch. Math. -Verein, vol.58, 1955. ,
Approximation Algorithms for the Weighted Independent Set Problem, LNCS, number 3787, p.341350 ,
DOI : 10.1007/11604686_30
Towards an Ecient Process Placement Policy for MPI Applications in Multicore Environments, EuroPVM/MPI, p.104115, 2009. ,
Process Distance-Aware Adaptive MPI Collective Communications, 2011 IEEE International Conference on Cluster Computing, p.196204, 2011. ,
DOI : 10.1109/CLUSTER.2011.30
Improving MPI Applications Performance on Multicore Clusters with Rank Reordering, EuroMPI, p.3949, 2011. ,
DOI : 10.1007/978-3-642-24449-0_7
URL : https://hal.archives-ouvertes.fr/hal-00643151
An approach for matching communication patterns in parallel applications, 2009 IEEE International Symposium on Parallel & Distributed Processing, 2009. ,
DOI : 10.1109/IPDPS.2009.5161035
The scientic case for high performance computing in Europe ,
Multi-core and Network Aware MPI Topology Functions, EuroMPI, p.5060, 2011. ,
Multicore Aware Process Mapping and its Impact on Communication Overhead of Parallel Applications, Proceedings of the IEEE Symp. on Comp. and Comm, p.811817, 2009. ,
Performance Eects of Node Mappings on the IBM BlueGene/L Machine, Euro-Par, p.10051013, 2005. ,
Design of a Scalable Inniband Topology Service to Enable Network-Topology-Aware Placement of Processes, Proceedings of the 2012 ACM/IEEE conference on Supercomputing (CDROM), p.12, 2012. ,
UPC Language Specications, v1.2, 2005. ,
Blue Gene system software---Topology mapping for Blue Gene/L supercomputer, Proceedings of the 2006 ACM/IEEE conference on Supercomputing , SC '06, p.116, 2006. ,
DOI : 10.1145/1188455.1188576
Hierarchical Collectives in MPICH2, Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, pp.325-326, 2009. ,
DOI : 10.1007/978-3-642-03770-2_41
Process Mapping for MPI Collective Communications, Euro-Par, p.8192, 2009. ,
DOI : 10.1109/71.642949