Structured Implicit Parallelism for scientific Simulations Sommaire 4.1 SIPSim pour les maillages réguliers à deux dimensions, p.74 ,
89 4.3.2 Résolution numérique et programmation, p.94 ,
C++ Template Metaprogramming : Concepts, Tools, and Techniques from Boost and Beyond (C++ in Depth Series), pp.49-158, 2004. ,
Measuring Application Development Productivity, IBM Application Development Symp, pp.83-92, 1979. ,
PGAS (Partitioned Global Address Space) Languages, Encyclopedia of Parallel Computing, pp.1539-1545, 2011. ,
Multi-level direct K-way hypergraph partitioning with multiple constraints and fixed vertices, Journal of Parallel and Distributed Computing, vol.68, issue.5, pp.609-625, 2008. ,
DOI : 10.1016/j.jpdc.2007.09.006
URL : https://hal.archives-ouvertes.fr/hal-00803479
Multi-level direct K-way hypergraph partitioning with multiple constraints and fixed vertices, Journal of Parallel and Distributed Computing, vol.68, issue.5, pp.609-625, 2008. ,
DOI : 10.1016/j.jpdc.2007.09.006
URL : https://hal.archives-ouvertes.fr/hal-00803479
Efficient Management of Parallelism in Object-Oriented Numerical Software Libraries, Modern Software Tools in Scientific Computing, pp.163-202, 1997. ,
DOI : 10.1007/978-1-4612-1986-6_8
A Partitioning Strategy for Nonuniform Problems on Multiprocessors, IEEE Transactions on Computers, vol.36, issue.5, pp.570-580, 1987. ,
DOI : 10.1109/TC.1987.1676942
Du support générique d'opérateurs de composition dans les modèles de composants logiciels, application au calcul scientifique, pp.18-156, 2010. ,
A low level component model easing performance portability of HPC applications, Computing, vol.4, issue.5, pp.18-156, 2013. ,
DOI : 10.1007/s00607-013-0368-3
URL : https://hal.archives-ouvertes.fr/hal-00911231
Implementing remote procedure calls, ACM Transactions on Computer Systems, vol.2, issue.1, pp.39-59, 1984. ,
DOI : 10.1145/2080.357392
Parallel Scientific Computation : A Structured Approach Using BSP and MPI. Oxford scholarship online, OUP Oxford, pp.18-34, 2004. ,
DOI : 10.1093/acprof:oso/9780198529392.001.0001
Two-Dimensional Approaches to Sparse Matrix Partitioning, Combinatorial Scientific Computing, Computational Science Series, pp.321-349 ,
DOI : 10.1201/b11644-13
ScaLA- PACK User's Guide, Society for Industrial and Applied Mathematics, p.53, 1997. ,
Nonlinear stability of finite volume methods for hyperbolic conservation laws, and well-balanced schemes for sources, Birkhäuser Basel, vol.2, pp.91-154, 2004. ,
Extending the SkelCL Skeleton Library for Stencil Computations on Multi-GPU Systems, pp.15-2014, 2014. ,
Using the Intel MPI benchmarks (IMB) to evaluate MPI implementations on an Infiniband Nehalem Linux cluster, Proceedings of the 2010 Spring Simulation Multiconference on, SpringSim '10, pp.1-240, 2010. ,
DOI : 10.1145/1878537.1878787
A Parallel Framework for Unstructured Grid Solvers, 1994. ,
DOI : 10.1007/978-3-0348-8534-8_10
The STAPL pView, Proceedings of the 23rd International Conference on Languages and Compilers for Parallel Computing, LCPC'10, pp.261-275, 2011. ,
DOI : 10.1007/978-3-642-13374-9_2
STAPL, Proceedings of the 3rd Annual Haifa Experimental Systems Conference on, SYSTOR '10, pp.1-14, 2010. ,
DOI : 10.1145/1815695.1815713
Decomposing irregularly sparse matrices for parallel matrix-vector multiplication, pp.75-86, 1996. ,
Hypergraph-based Dynamic Load Balancing for Adaptive Scientific Computations, 2007 IEEE International Parallel and Distributed Processing Symposium, 2007. ,
DOI : 10.1109/IPDPS.2007.370258
Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication, IEEE Transactions on Parallel and Distributed Systems, vol.10, issue.7, pp.673-693 ,
DOI : 10.1109/71.780863
Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication, IEEE Transactions on Parallel and Distributed Systems, vol.10, issue.7, pp.673-693, 1999. ,
DOI : 10.1109/71.780863
A Fine-Grain Hypergraph Model for 2D Decomposition of Sparse Matrices Parallel Programmability and the Chapel Language, IPDPS, page 118, pp.291-312, 2001. ,
ZPL: a machine independent programming language for parallel computers, IEEE Transactions on Software Engineering, vol.26, issue.3, pp.197-211, 2000. ,
DOI : 10.1109/32.842947
CGMgraph/CGMlib: Implementing and Testing CGM Graph Algorithms on PC Clusters, In International Journal of High Performance Computing Applications, pp.2005-2054, 2003. ,
DOI : 10.1007/978-3-540-39924-7_20
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.5.9084
Using OpenMP : Portable Shared Memory Parallel Programming (Scientific and Engineering Computation), pp.15-46, 2007. ,
X10, ACM SIGPLAN Notices, vol.40, issue.10, pp.519-538, 2005. ,
DOI : 10.1145/1103845.1094852
URL : https://hal.archives-ouvertes.fr/in2p3-00166974
PATUS: A Code Generation and Autotuning Framework for Parallel Iterative Stencil Computations on Modern Microarchitectures, 2011 IEEE International Parallel & Distributed Processing Symposium, pp.676-687, 2011. ,
DOI : 10.1109/IPDPS.2011.70
The Münster Skeleton Library Muesli : A comprehensive overview, European Research Center for Information Systems (ERCIS), 2009. ,
Graph Twiddling in a MapReduce World, Computing in Science & Engineering, vol.11, issue.4, pp.29-41, 2009. ,
DOI : 10.1109/MCSE.2009.120
Bringing skeletons out of the closet: a pragmatic manifesto for skeletal parallel programming, Parallel Computing, vol.30, issue.3, pp.389-406, 2004. ,
DOI : 10.1016/j.parco.2003.12.002
Algorithmic Skeletons, 1988. ,
DOI : 10.1007/978-1-4471-0841-2_13
FullSWOF Paral: Comparison of two parallelization strategies (MPI and SKELGIS) on a software designed for hydrology applications, ESAIM: Proceedings, vol.43, pp.59-79, 2013. ,
DOI : 10.1051/proc/201343004
URL : https://hal.archives-ouvertes.fr/hal-00845812
Blood Flow Arterial Network Simulation with the Implicit Parallelism Library SkelGIS, ICCS, 2014. ,
DOI : 10.1016/j.procs.2014.05.010
URL : https://hal.archives-ouvertes.fr/hal-01446450
Parallelization of Shallow-Water Equations with the Algorithmic Skeleton Library SkelGIS, ICCS, volume 18 of Procedia Computer Science, pp.591-600, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00832660
Implementation and Performance Analysis of SkelGIS for Network Mesh-Based Simulations, Euro-Par, 2014. ,
DOI : 10.1007/978-3-319-09873-9_37
URL : https://hal.archives-ouvertes.fr/hal-01094340
Multigrid aircraft computations using the OPlus parallel library*, 1995. ,
DOI : 10.1016/B978-044482322-9/50096-7
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.48.9819
ComPASS : a tool for distributed parallel finite volume discretizations on general unstructured polyhedral meshes, ESAIM: Proceedings, vol.43, pp.147-163, 2013. ,
DOI : 10.1051/proc/201343010
URL : https://hal.archives-ouvertes.fr/hal-00781235
MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008. ,
DOI : 10.1145/1327452.1327492
Simulation du ruissellement d'eau de pluie sur des surfaces agricoles/ rain water overland flow on agricultural fields simulation, pp.90-91, 2010. ,
DOI : 10.1090/psapm/067.2/2605249
URL : https://hal.archives-ouvertes.fr/tel-00531377
Parallel hypergraph partitioning for scientific computing, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium, 2006. ,
DOI : 10.1109/IPDPS.2006.1639359
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.103.5418
Liszt, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, pp.1-9, 2011. ,
DOI : 10.1145/2063384.2063396
Distributed Compressed Sparse Row SkePU : a multi-backend skeleton programming library for multi-GPU systems, Proceedings of the fourth international workshop on High-level parallel programming and applications, pp.5-14, 2010. ,
Abusing a hypergraph partitioner for unweighted graph partitioning, Graph Partitioning and Graph Clustering, pp.19-35, 2013. ,
DOI : 10.1090/conm/588/11707
Some Computer Organizations and Their Effectiveness, IEEE Transactions on Computers, vol.21, issue.9, pp.948-960, 1972. ,
DOI : 10.1109/TC.1972.5009071
A Fast Fourier Transform Compiler, Proceedings of the ACM SIG- PLAN 1999 Conference on Programming Language Design and Implementation, PLDI '99, pp.169-180, 1999. ,
Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation, Proceedings, 11th European PVM/MPI Users' Group Meeting, pp.97-104, 2004. ,
DOI : 10.1007/978-3-540-30218-6_19
Some simplified NP-complete graph problems, Theoretical Computer Science, vol.1, issue.3, pp.237-267, 1976. ,
DOI : 10.1016/0304-3975(76)90059-1
URL : http://doi.org/10.1016/0304-3975(76)90059-1
A New Algorithm for Multi-objective Graph Partitioning, Proceedings of Europar, pp.322-331, 1999. ,
Performance analysis of the OP2 framework on many-core architectures, ACM SIGMETRICS Performance Evaluation Review, vol.38, issue.4, pp.9-15, 2011. ,
DOI : 10.1145/1964218.1964221
The MPI 2.2 Standard and the Emerging MPI 3 Standard, Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, pp.2-2, 2009. ,
DOI : 10.1007/978-3-642-03770-2_2
The parallel bgl : A generic library for distributed graph computations, Parallel Object-Oriented Scientific Computing (POOSC, pp.49-113, 2005. ,
MPICH2: A New Start for MPI Implementations, Proceedings of the 9th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, p.7, 2002. ,
DOI : 10.1007/3-540-45825-5_5
PSTL -A C++ Persistent Standard Template Library, Proceedings of the 6th USENIX Conference on Object-Oriented Technologies and Systems, pp.147-158, 2001. ,
Elements of Software Science (Operating and programming systems series) Elsevier Science, pp.59-95, 1977. ,
Graph partitioning models for parallel computing, Parallel Computing, vol.26, issue.12, pp.1519-1534, 2000. ,
DOI : 10.1016/S0167-8191(00)00048-X
The Chaco User's Guide : Version 2.0, p.36, 1994. ,
DOI : 10.2172/10106339
URL : http://www.osti.gov/scitech/servlets/purl/10106339
An Overview of Trilinos, 2003. ,
BSPlib: The BSP programming library, Parallel Computing, vol.24, issue.14, pp.1947-1980, 1998. ,
DOI : 10.1016/S0167-8191(98)00093-3
Thrust : A Parallel Template Library, 2010. ,
OSL: Optimized Bulk Synchronous Parallel Skeletons on Distributed Arrays, Advanced Parallel Processing Technologies, pp.436-451, 2009. ,
DOI : 10.1145/79173.79181
URL : https://hal.archives-ouvertes.fr/inria-00452523
Parallel Programming and Performance Predictability with Orléans Skeleton Library, International Conference on High Performance Computing and Simulation, pp.257-263, 2011. ,
Introduction aux méthodes numériques, pp.19-89, 2000. ,
An auto-tuning framework for parallel multicore stencil computations, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), pp.1-12, 2010. ,
DOI : 10.1109/IPDPS.2010.5470421
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.188.9104
PEGASUS: A Peta-Scale Graph Mining System Implementation and Observations, 2009 Ninth IEEE International Conference on Data Mining, pp.229-238, 2009. ,
DOI : 10.1109/ICDM.2009.14
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.156.764
A Parallel Skeleton Library for Multi-core Clusters, 2009 International Conference on Parallel Processing, pp.84-91, 2009. ,
DOI : 10.1109/ICPP.2009.18
A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs, SIAM Journal on Scientific Computing, vol.20, issue.1, pp.359-392, 1998. ,
DOI : 10.1137/S1064827595287997
Parallel multilevel k-way partitioning scheme for irregular graphs, Proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM) , Supercomputing '96, pp.96-129, 1998. ,
DOI : 10.1145/369028.369103
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.133.6013
Treadmarks : Distributed shared memory on standard workstations and operating systems, Proceedings of the USENIX Winter 1994 Technical Conference on USENIX Winter 1994 Technical Conference, WTEC'94, pp.10-10, 1994. ,
An empirical validation of software cost estimation models, Communications of the ACM, vol.30, issue.5, pp.416-429, 1987. ,
DOI : 10.1145/22899.22906
SBV-Cut: Vertex-cut based graph partitioning using structural balance vertices, Data & Knowledge Engineering, vol.72, pp.285-303 ,
DOI : 10.1016/j.datak.2011.11.004
Implementing Spectral Methods for Partial Differential Equations : Algorithms for Scientists and Engineers, Mathematics and Statistics, 2009. ,
DOI : 10.1007/978-90-481-2261-5
The Hungarian Method for the Assignment Problem, 50 Years of Integer Programming, pp.29-47, 2010. ,
Squelettes algorithmiques pour la programmation et l'exécution efficaces de codes parallèles. These, pp.50-51, 2013. ,
Hierarchical Partitioning Techniques for Structured Adaptive Mesh Refinement (SAMR) Applications, ICPP Workshops, pp.336-343, 2002. ,
Automatic tiling of iterative stencil loops, ACM Transactions on Programming Languages and Systems, vol.26, issue.6, 2004. ,
DOI : 10.1145/1034774.1034777
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.9.5190
Calcul scientifique parallèle : Cours, exemples avec openMP et MPI , exercices corrigés, Mathématiques appliquées pour le Master/S- MAI. Dunod, pp.2013-2058 ,
Pregel : A System for Large-scale Graph Processing, Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, SIGMOD '10, pp.135-146, 2010. ,
Physis, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, pp.1-1112, 2011. ,
DOI : 10.1145/2063384.2063398
Parallel skeletons for manipulating general trees, Parallel Computing, vol.32, issue.7-8, pp.590-603, 2006. ,
DOI : 10.1016/j.parco.2006.06.002
A Complexity Measure, Proceedings of the 2Nd International Conference on Software Engineering, ICSE '76, p.407, 1976. ,
DOI : 10.1109/TSE.1976.233837
Scalability, portability and predictability: The BSP approach to parallel programming, Future Generation Computer Systems, vol.12, issue.4, pp.265-272, 1996. ,
DOI : 10.1016/S0167-739X(96)00030-1
An Introduction to High Performance Fortran, Scientific Programming, vol.4, issue.2, pp.87-113, 1995. ,
DOI : 10.1155/1995/612973
URL : http://doi.org/10.1155/1995/612973
OP2: An active library framework for solving unstructured mesh-based applications on multi-core and many-core architectures, 2012 Innovative Parallel Computing (InPar), pp.1-12, 2012. ,
DOI : 10.1109/InPar.2012.6339594
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.259.3000
STL Tutorial and Reference Guide, Second Edition : C++ Programming with the Standard Template Library, 2001. ,
Pthreads programming -a POSIX standard for better multiprocessing, 1996. ,
Rectilinear Partitioning of Irregular Data Parallel Computations, Journal of Parallel and Distributed Computing, vol.23, issue.2, pp.119-134, 1994. ,
DOI : 10.1006/jpdc.1994.1126
Co-array Fortran for parallel programming, ACM SIGPLAN Fortran Forum, vol.17, issue.2, pp.1-31, 1998. ,
DOI : 10.1145/289918.289920
Programming in Scala : A Comprehensive Step-by-step Guide, Artima Incorporation, 2008. ,
Scotch: A software package for static mapping by dual recursive bipartitioning of process and architecture graphs, HPCN Europe, pp.493-498, 1996. ,
DOI : 10.1007/3-540-61142-8_588
A Medium-Grain Method for Fast 2D Bipartitioning of Sparse Matrices, 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014. ,
DOI : 10.1109/IPDPS.2014.62
An Introduction to Partial Differential Equations in An introduction to partial differential equations, pp.25-28, 2005. ,
The TAO of parallelism in algorithms, Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, PLDI 2011, pp.12-25, 2011. ,
Intel Threading Building Blocks, 2007. ,
Guide to Partitioning Unstructured Meshes for Parallel Computing, pp.42-43 ,
CUDA by Example : An Introduction to General- Purpose GPU Programming, 2010. ,
Parallel Multilevel Algorithms for Multi-constraint Graph Partitioning, Proceedings of Supercomputing, p.43, 1998. ,
DOI : 10.1007/3-540-44520-X_39
Development of Mixed Mode MPI / OpenMP Applications, Scientific Programming, vol.9, issue.2-3, pp.83-98, 2001. ,
DOI : 10.1155/2001/450503
Introduction to Computational PDEs, pp.25-28 ,
SkelCL - A Portable Skeleton Library for High-Level GPU Programming, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum, pp.1176-1182 ,
DOI : 10.1109/IPDPS.2011.269
OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems, Computing in Science & Engineering, vol.12, issue.3, pp.66-73, 2010. ,
DOI : 10.1109/MCSE.2010.69
The STAPL pArray, Proceedings of the 2007 workshop on MEmory performance DEaling with Applications, systems and architecture, MEDEA '07, pp.73-80, 2007. ,
DOI : 10.1145/1327171.1327180
The STAPL Parallel Container Framework, Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, pp.235-246, 2011. ,
The STAPL pList, Proceedings of the 22Nd International Conference on Languages and Compilers for Parallel Computing, LCPC'09, pp.16-30, 2010. ,
DOI : 10.1007/978-3-642-13374-9_2
The pochoir stencil compiler, Proceedings of the 23rd ACM symposium on Parallelism in algorithms and architectures, SPAA '11, pp.117-128, 2011. ,
DOI : 10.1145/1989493.1989508
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.208.6030
Handbook of Grid Generation, 1998. ,
A bridging model for parallel computation, Communications of the ACM, vol.33, issue.8, pp.103-111, 1990. ,
DOI : 10.1145/79173.79181
A bridging model for parallel computation, Communications of the ACM, vol.33, issue.8, pp.103-111, 1990. ,
DOI : 10.1145/79173.79181
A Two-Dimensional Data Distribution Method for Parallel Sparse Matrix-Vector Multiplication, SIAM Review, vol.47, issue.1, pp.67-95, 2005. ,
DOI : 10.1137/S0036144502409019
Dynamic load-balancing for PDE solvers on adaptive unstructured meshes. Concurrency -Practice and Experience, pp.17-28, 1995. ,
DOI : 10.1002/cpe.4330070103
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.10.1115
JOSTLE - Multilevel Graph Partitioning Software: An Overview, Mesh Partitioning Techniques and Domain Decomposition Techniques, pp.27-58, 2007. ,
DOI : 10.4203/csets.17.2
Multilevel Mesh Partitioning for Optimizing Domain Shape, International Journal of High Performance Computing Applications, vol.13, issue.4, 1999. ,
DOI : 10.1177/109434209901300404
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.19.7235
Parallel Dynamic Graph Partitioning for Adaptive Unstructured Meshes, Journal of Parallel and Distributed Computing, vol.47, issue.2, pp.102-108, 1997. ,
DOI : 10.1006/jpdc.1997.1407
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.38.8817
Multiphase mesh partitioning, Applied Mathematical Modelling, vol.25, issue.2, pp.123-140, 2000. ,
DOI : 10.1016/S0307-904X(00)00041-X
URL : http://doi.org/10.1016/s0307-904x(00)00041-x
Verification and comparison of four numerical schemes for a 1D viscoelastic blood flow model, Computer Methods in Biomechanics and Biomedical Engineering, vol.15, issue.1, pp.123-125 ,
DOI : 10.1016/0021-9290(86)90118-1
URL : https://hal.archives-ouvertes.fr/hal-00807040
Hadoop : The Definitive Guide, 2009. ,
OpenACC ??? First Experiences with Real-World Applications, Proceedings of the 18th International Conference on Parallel Processing, Euro-Par'12, pp.859-870, 2012. ,
DOI : 10.1007/978-3-642-32820-6_85
Algorithms + Data Structures = Programs, 1978. ,
An object-oriented bulk synchronous parallel library for multicore programming, Concurrency and Computation: Practice and Experience, vol.16, issue.1-2, pp.533-553 ,
DOI : 10.1002/cpe.1843
Controlling Unstructured Mesh Partitions for Massively Parallel Simulations, SIAM Journal on Scientific Computing, vol.32, issue.6, pp.3201-3227, 2010. ,
DOI : 10.1137/090777323