, Conduit: Simpliied data exchange for hpc simulations

. Accessed, , pp.2018-2024

, The openmp api speciication for parallel programming, pp.2018-2024

, Tbb initialization, termination, and resource management details, juicy and gory

, Threading building blocks

. Accessed, , pp.2018-2024

, Top500 supercomputer sites

. Accessed, , pp.2018-2025

E. Agullo, O. Aumage, M. Faverge, N. Furmento, F. Pruvost et al., Achieving high performance on supercomputers with a sequential task-based programming model, p.31
URL : https://hal.archives-ouvertes.fr/hal-01618526

S. Ahern, A. Shoshani, K. Ma, A. Choudhary, T. Critchlow et al., Scientiic discovery at the exascale. report from the doe ascr 2011 workshop on exascale data management. Analysis, and Visualization, vol.2, 2011.

J. Ahrens, J. Patchett, A. Bauer, S. Jourdain, H. David et al., Patricia Fasel, and Francesca Samsel. In situ mpasocean image-based visualization, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, Visualization & Data Analytics Showcase, p.33, 2014.

J. Allard, J. Lesage, and B. Raan, Modularity for large virtual reality applications, Presence: Teleoperators and Virtual Environments, vol.19, issue.2, p.41, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00688474

. Mp-allen and . Tildesley, Computer simulation of liquids, 1987.

I. Altintas, C. Berkley, E. Jaeger, M. Jones, B. Ludascher et al., Kepler: an extensible system for design and execution of scientiic workkows, Scientiic and Statistical Database Management, p.42, 2004.

D. Saman-amarasinghe, W. Campbell, A. Carlson, W. Chien, E. Dally et al., Robert HarTask-Based In Situ for Molecular Dynamics on Exascale Computers rison, DARPA IPTO, p.31, 2009.

C. Augonnet, O. Aumage, N. Furmento, R. Namyst, and S. Thibault, Starpu-mpi: Task programming over clusters of machines enhanced with accelerators, European MPI Users' Group Meeting, pp.298-299
URL : https://hal.archives-ouvertes.fr/hal-00725477

. Springer, , 2012.

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, Starpu: a uniied platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience, vol.23, issue.2, pp.28-153, 2011.

U. Ayachit, The paraview guide, p.34, 2015.

U. Ayachit, B. Whitlock, M. Wolf, B. Loring, B. Geveci et al., The sensei generic in situ interface, Situ Infrastructures for Enabling ExtremeScale Analysis and Visualization (ISAV), Workshop on, p.34, 2016.

A. C. Bauer, H. Abbasi, J. Ahrens, H. Childs, B. Geveci et al., situ methods, infrastructures, and applications on high performance computing platforms

, Computer Graphics Forum, vol.35, issue.3, p.33, 2016.

C. Andrew, B. Bauer, W. Geveci, and . Schroeder, The paraview catalyst user's guide v2. 0. kitware, p.34, 2015.

M. Bauer, S. Treichler, E. Slaughter, and A. Aiken, Legion: Expressing locality and independence with logical regions, Proceedings of the international conference on high performance computing, networking, storage and analysis, pp.31-153, 2012.

J. C. Bennett, H. Abbasi, . Peer-timo, R. Bremer, A. Grout et al., Combining in-situ and in-transit processing to enable extreme-scale scientiic analysis, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC '12, vol.49, pp.1-49, 2012.

D. Robert, . Blumofe, F. Christopher, . Joerg, C. Bradley et al., Cilk: An eecient multithreaded runtime system, vol.30, pp.19-28, 1995.

D. Robert, C. E. Blumofe, and . Leiserson, Scheduling multithreaded computations by work stealing, Journal of the ACM (JACM), vol.46, issue.5, p.28, 1999.

E. Boman, K. Devine, L. A. Fisk, R. Heaphy, B. Hendrickson et al., Zoltan 3.0: parallel partitioning, load-balancing, and data management services; user's guide. Sandia National Laboratories, p.144, 2007.

R. David and . Butenhof, Programming with POSIX threads, p.27, 1997.

. Thomas-e-cheatham, . Iii, and . Daniel-r-roe, The impact of heterogeneous computing on workows for biomolecular simulation and analysis, Computing in Science & Engineering, vol.17, issue.2, p.67, 2015.

H. Childs, E. Brugger, B. Whitlock, J. Meredith, S. Ahern et al., Visit: An end-user tool for visualizing and analyzing very large data, p.34, 2012.

Y. Cho, S. Oh, and B. Egger, Adaptive space-shared scheduling for sharedmemory parallel programs, Job Scheduling Strategies for Parallel Processing, pp.158-177, 2015.

E. Cieren, L. Colombet, S. Pitoiset, and R. Namyst, Exastamp: A parallel framework for molecular dynamics on heterogeneous clusters, European Conference on Parallel Processing, pp.121-132

K. Coulomb, . Faverge, . Jazeix, . Lagrasse, P. Marcoueille et al., Visual trace explorer (vite), p.70, 2009.

J. David-e-culler, A. Singh, and . Gupta, Parallel computer architecture: a hardware/software approach. Gulf Professional Publishing, 1999.

L. Dagum and R. Menon, Openmp: an industry standard api for shared-memory programming, IEEE computational science and engineering, vol.5, issue.1, p.28, 1998.

B. J-chassin-de-kergommeaux, P. Stein, and . Bernard, Pajé, an interactive visualization tool for tuning multi-threaded parallel applications, Parallel Computing, vol.26, issue.10, pp.1253-1274, 2000.

J. Dean and S. Ghemawat, Mapreduce: Simpliied data processing on large clusters, Commun. ACM, vol.51, issue.1, pp.107-113, 2008.

E. Deelman, T. Peterka, I. Altintas, D. Christopher, K. Carothers et al., The future of scientiic workkows, The International Journal of High Performance Computing Applications, vol.32, issue.1, p.42, 2018.

E. Deelman, G. Singh, M. Su, J. Blythe, Y. Gil et al., Pegasus: A framework for mapping complex scientiic workkows onto distributed systems, Scientiic Programming, vol.13, issue.3, p.42, 2005.

E. Dirand, L. Colombet, and B. Raan, Tins: A task-based dynamic helper core strategy for in situ analytics, Asian Conference on Supercomputing Frontiers, pp.159-178
URL : https://hal.archives-ouvertes.fr/hal-01730910

. Springer, , 2018.

, ¢ Cited on, vol.20, p.165

C. Docan, M. Parashar, and S. Klasky, Dart: a substrate for high speed asynchronous data io, Proceedings of the 17th international symposium on High performance distributed computing, p.39, 2008.

C. Docan, M. Parashar, and S. Klasky, DataSpaces: an Interaction and Coordination Framework for Coupled Simulation Workkows, Cluster Computing, vol.15, issue.2, p.39, 2012.

S. Donovan, G. Huizenga, J. Andrew, C. Hutton, . Ross et al., Lustre: Building a le system for 1000-node clusters, Proceedings of the Linux Symposium, p.32, 2003.

M. Dorier, G. Antoniu, F. Cappello, M. Snir, and L. Orf, Damaris: How to eeciently leverage multicore parallelism to achieve scalable, jitter-free i/o, 2012 IEEE International Conference on Cluster Computing, pp.155-163, 2012.

M. Dorier, R. Sisneros, T. Peterka, G. Antoniu, and D. Semeraro, Damaris/viz: A nonintrusive, adaptable and user-friendly in situ visualization framework, Large-Scale Data Analysis and Visualization (LDAV), 2013 IEEE Symposium on, pp.67-75, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00859603

M. Dorier, R. Sisneros, L. B. Gomez, T. Peterka, L. Orf et al., Adaptive performance-constrained in situ visualization of atmospheric simulations, IEEE International Conference on, pp.269-278, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01351919

M. Dreher and B. Raan, A exible framework for asynchronous in situ and in transit analytics for scientiic simulations, Cluster, Cloud and Grid Computing (CCGrid), pp.277-286, 2014.

M. Dreher and T. Peterka, Bredala: Semantic data redistribution for in situ applications, Cluster Computing (CLUSTER), 2016.
URL : https://hal.archives-ouvertes.fr/hal-01358482

, IEEE International Conference on, pp.279-288, 2016.

M. Dreher and T. Peterka, Decaf: Decoupled dataaows for in situ high-performance workkows, Argonne National Lab.(ANL), p.130, 2017.

M. Dreher, K. Sasikumar, S. Sankaranarayanan, and T. Peterka, Manala: a exible ow control library for asynchronous task communication, 2017 IEEE International Conference on Cluster Computing (CLUS-TER), p.43

O. Durand and . Soulard, Power law and exponential ejecta size distributions from the dynamic fragmentation of shock-loaded cu and sn metals under melt conditions, Journal of Applied Physics, vol.114, issue.19, pp.54-135, 2013.

O. Durand, . Jaouen, O. Soulard, L. Heuze, and . Colombet, Comparative simulations of microjetting using atomistic and continuous approaches in the presence of viscosity and surface tension, Journal of Applied Physics, vol.122, issue.13, p.135107, 2017.

M. Folk, A. Cheng, and K. Yates, Hdf5: A le format and i/o library for high performance computing applications, Proceedings of supercomputing, vol.99, p.32, 1999.

J. Freire, T. Cláudio, . Silva, P. Steven, E. Callahan et al., Managing rapidly-evolving scientiic workkows, International Provenance and Annotation Workshop, p.42, 2006.

Y. Fu, F. Li, F. Song, and Z. Chen, Performance analysis and optimization of in-situ integration of simulation with data analysis: zipping applications up, Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, p.43, 2018.

T. Gautier, X. Besseron, and L. Pigeon, Kaapi: A thread scheduling runtime system for data ow computations on cluster of multi-processors, Proceedings of the 2007 international workshop on Parallel symbolic computation, p.28, 2007.

A. Goswami, Y. Tian, K. Schwan, F. Zheng, J. Young et al., Landrush: Rethinking in-situ analysis for gpgpu workkows, 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), pp.32-41, 2016.

D. Grewe, Z. Wang, and M. Boyle, A workload-aware mapping approach for data-parallel programs, Proceedings of the 6th International Conference on High Performance and Embedded Architectures and Compilers, p.38, 2011.

S. Habib, V. Morozov, H. Finkel, A. Pope, K. Heitmann et al., The universe at extreme scale: multi-petaaop sky simulation on the bg/q, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, 2012.

T. Harris, M. Maas, J. Virendra, and . Marathe, Callisto: co-scheduling parallel runtime systems, Proceedings of the Ninth European Conference on Computer Systems, p.100, 2014.

A. Heirich, E. Slaughter, M. Papadakis, W. Lee, T. Biedert et al., situ visualization with task-based parallelism, 2017.

M. Ariza, M. Dreher, C. J. Barrios-hernandez, and B. Raan, Asynchronous In Situ Processing with Gromacs: Taking Advantage of GPUs, Latin America High Performance Computing Conference, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01180364

R. Hoque, T. Herault, G. Bosilca, and J. Dongarra, Dynamic task discovery in parsec: a data--ow task-based runtime, Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, p.31, 2017.

A. Hugo, A. Guermouche, R. Namyst, and P. Wacrenier, Composing multiple starpu applications over heterogeneous machines: a supervised approach, Third International Workshop on Accelerators and Hybrid Exascale Systems, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00824514

W. Humphrey, A. Dalke, and K. Schulten, Vmd: visual molecular dynamics, vol.14, pp.57-67, 1996.

M. Alexandru-c-iordan, L. Jahre, and . Natvig, Tuning the victim selection policy of intel tbb, Journal of Systems Architecture, vol.61, issue.10, pp.584-591, 2015.

S. Joussaume, . Bellucci, R. Biercamp, . Budich, M. Dawson et al., Modelling the earth's climate system: data and computing challenges, High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion, p.32, 2012.

H. Kaiser, T. Heller, B. Adelsteinlelbach, A. Serio, and D. Fey, Hpx: A task based programming model in a global address space, Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, pp.31-153, 2014.

H. Kaiser, T. Heller, D. Bourgeois, and D. Fey, Higher-level parallelization for local and distributed asynchronous task-based programming, Proceedings of the First International Workshop on Extreme Scale Programming Models and Middleware, p.31, 2015.

J. Kress, S. Klasky, N. Podhorszki, J. Choi, H. Childs et al., Loosely Coupled In Situ Visualization: A Perspective on Why It's Here to Stay, Proceedings of the First Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization, ISAV2015, pp.1-6, 2015.

. T-kuhlen, K. Pajarola, and . Zhou, Parallel in situ coupling of simulation with a fully featured visualization system, Proceedings of the 11th

, Eurographics Conference on Parallel Graphics and Visualization (EGPGV), p.34, 2011.

M. Larsen, J. Ahrens, U. Ayachit, E. Brugger, H. Childs et al., The alpine in situ infrastructure: Ascending from the ashes of strawman, Proceedings of the In Situ Infrastructures on Enabling Extreme-Scale Analysis and Visualization, p.42, 2017.

M. Larsen, E. Brugger, H. Childs, J. Eliot, K. Griin et al., Strawman: A Batch In Situ Visualization and Analysis Infrastructure for Multi-Physics Simulation Codes, Proceedings of the First Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization, ISAV2015, pp.30-35, 2015.

W. Lechner and C. Dellago, Accurate determination of crystal structures based on averaged local bond order parameters. The Journal of chemical physics, vol.129, p.136, 2008.

T. Leng, R. Ali, J. Hsieh, V. Mashayekhi, and R. Rooholamini, An empirical study of hyper-threading in high performance computing clusters, Linux HPC Revolution, vol.45, p.29, 2002.

J. Li, W. Liao, A. Choudhary, R. Ross, R. Thakur et al., Parallel netcdf: A highperformance scientiic i/o interface, Acm/Ieee Conference, p.32, 2003.

M. Li, S. Sudharshan, A. R. Vazhkudai, F. Butt, X. Meng et al., Functional partitioning to optimize end-to-end performance on many-core architectures, High Performance Computing, Networking, Storage and Analysis (SC), 2010 International Conference for, p.37, 2010.

E. Liu, J. Pacitti, P. Valduriez, and M. Mattoso, A survey of data-intensive scientiic workkow management, Journal of Grid Computing, vol.13, issue.4, p.42, 2015.

Q. Liu, J. Logan, Y. Tian, H. Abbasi, N. Podhorszki et al., Hello adios: The challenges and lessons of developing leadership class i/o frameworks, Concurr. Comput. : Pract. Exper, vol.26, issue.7, pp.1453-1473, 2014.

J. F. Lofstead, S. Klasky, K. Schwan, N. Podhorszki, and C. Jin, Flexible IO and Integration for Scientiic Codes Through The Adaptable IO System (ADIOS), Proceedings of the 6th International Workshop on Challenges of Large Applications in Distributed Environments, CLADE '08, pp.15-24, 2008.

X. Ma, J. Lee, and M. Winslett, High-level buuering for hiding periodic output cost in scientiic simulations, IEEE Transactions on Parallel and Distributed Systems, vol.17, issue.3, pp.193-204, 2006.

P. Malakar, V. Vishwanath, C. Knight, T. Munson, and M. E. Papka, Optimal execution of co-analysis for large-scale molecular dynamics simulations, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, p.43, 2016.

P. Malakar, V. Vishwanath, T. Munson, C. Knight, M. Hereld et al., Optimal scheduling of in-situ analysis for large-scale scientiic simulations, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC '15, vol.52, p.43, 2015.

R. Timothy-g-mattson, V. Cledat, V. Cavé, Z. Sarkar, S. Budimlic et al., The open community runtime: A runtime system for extreme scale computing, HPEC, p.28, 2016.

D. Michael, . Mccool, D. Arch, J. Robison, and . Reinders, Structured parallel programming: patterns for eecient computation, p.28, 2012.

N. Michaud-agrawal, E. J. Denning, T. B. Woolf, and O. Beckstein, Mdanalysis: a toolkit for the analysis of molecular dynamics simulations, Journal of computational chemistry, vol.32, issue.10, p.67, 2011.

P. Missier, S. Soiland-reyes, S. Owen, W. Tan, A. Nenadic et al., International conference on scientiic and statistical database management, p.42, 2010.

C. Mommessin, M. Dreher, B. Raan, and T. Peterka, Automatic data ltering for in situ workkows, 2017 IEEE International Conference on, pp.43-126

H. Oscar, P. G. Mondragon, T. Bridges, and . Jones, Quantifying scheduling challenges for exascale system software, Proceedings of the 5th International Workshop on Runtime and Operating Systems for Supercomputers, p.35, 2015.

H. Oscar, P. G. Mondragon, S. Bridges, . Levy, B. Kurt et al., Scheduling in-situ analytics in next-generation applications, Cluster, Cloud and Grid Computing (CCGrid), pp.36-77, 2016.

E. Gordon and . Moore, Cramming more components onto integrated circuits, vol.38, p.25, 1965.

W. Ryan, . Moore, and . Bruce-r-childers, Using utility prediction models to dynamically choose program thread counts, Performance Analysis of Systems and Software (ISPASS), 2012 IEEE International Symposium on, p.38, 2012.

K. Moreland, C. Sewell, W. Usher, L. Lo, J. Meredith et al., Vtk-m: Accelerating the visualization toolkit for massively threaded architectures, IEEE computer graphics and applications, vol.36, issue.3, p.42, 2016.

J. Philip, S. Mucci, C. Browne, G. Deane, and . Ho, Papi: A portable interface to hardware performance counters, Proceedings of the department of defense HPCMP users group conference, vol.710, p.78, 1999.

D. John, D. Owens, N. Luebke, M. Govindaraju, J. Harris et al., A survey of general-purpose computation on graphics hardware, Computer graphics forum, vol.26, p.27, 2007.

P. Pébaÿ and J. Bennett, An asynchronous many-task implementation of in-situ statistical analysis using legion. Sandia National Laboratories, 2015.

P. Pebay, J. C. Bennett, D. Hollman, S. Treichler, S. Patrick et al., Towards asynchronous many-task in situ data analysis using legion, Parallel and Distributed Processing Symposium Workshops, p.42, 2016.

T. Peterka, R. Ross, A. Gyulassy, V. Pascucci, W. Kendall et al., Scalable parallel building blocks for custom data analysis, IEEE Symposium on Large Data Analysis and Visualization (LDAV 2011), p.42, 2011.

N. Pineau, . Soulard, . Colombet, . Carrard, P. Pellé et al., Molecular dynamics simulations of shock compressed heterogeneous materials. ii. the graphite/diamond transition case for astrophysics applications, Journal of Applied Physics, vol.117, issue.11, p.54, 2015.

S. Plimpton, Fast parallel algorithms for short-range molecular dynamics, Journal of computational physics, vol.117, issue.1, p.74, 1995.

A. Raman, A. Zaks, J. W. Lee, and D. August, Parcae: a system for exible parallel execution, ACM SIGPLAN Notices, vol.47, p.38, 2012.

J. Reinders, Intel threading building blocks: outttting C++ for multi-core processor parallelism, p.28, 2007.

E. Schikuta, Message-passing-interface-forum: Mpi: A message-passing interface standard

. Techn and . Ber, , 1994.

B. Schlegel and M. J. Frisch, Transformation between cartesian and pure spherical harmonic gaussians, International Journal of Quantum Chemistry, vol.54, issue.2, p.136, 1995.

J. Will and . Schroeder, Bill Lorensen, and Ken Martin. The visualization toolkit: an object-oriented approach to 3D graphics. Kitware, p.34, 2004.

A. Singh, P. Balaji, and W. Feng, Gepsea: a general-purpose software acceleration framework for lightweight task oooading, Parallel Processing, 2009. ICPP'09. International Conference on, p.37, 2009.

L. Soulard, Molecular dynamics study of the micro-spallation, The European Physical Journal D, vol.50, issue.3, pp.241-251, 2008.

L. Soulard, J. Pineau, L. Clérouin, and . Colombet, Molecular dynamics simulations of shock compressed heterogeneous materials. i. the porous case, Journal of Applied Physics, vol.117, issue.11, p.34, 2015.

A. Stukowski, Visualization and analysis of atomistic simulation data with ovito-the open visualization tool. Modelling and Simulation in Materials Science and Engineering, vol.18, p.57, 2009.

H. Sutter, The free lunch is over: A fundamental turn toward concurrency in software

, Dobb's journal, vol.30, p.25, 2005.

R. Thakur and W. Gropp, Test suite for evaluating performance of mpi implementations that support mpi_thread_multiple, European Parallel Virtual Machine/Message Passing Interface Users' Group Meeting, p.31, 2007.

R. Thakur, W. Gropp, and E. Lusk, On implementing mpi-io portably and with high performance, Proceedings of the sixth workshop on I/O in parallel and distributed systems, p.32, 1999.

D. Van-der, E. Spoel, B. Lindahl, G. Hess, A. E. Groenhof et al., Gromacs: fast, exible, and free, Journal of computational chemistry, vol.26, issue.16, pp.1701-1718, 2005.

S. Jeerey and . Vetter, Contemporary high performance computing: from Petascale toward exascale, 2013.

P. Virouleau, F. Broquedis, T. Gautier, and F. Rastello, Using data dependencies to improve task-based scheduling strategies on numa architectures, European Conference on Parallel Processing, pp.531-544
URL : https://hal.archives-ouvertes.fr/hal-01338761

. Springer, , 2016.

V. Vishwanath, M. Hereld, and M. E. Papka, Toward simulation-time data analysis and i/o acceleration on leadership-class systems, Large Data Analysis and Visualization (LDAV), 2011.

, IEEE Symposium on, pp.9-14, 2011.

E. Edward and . Zajac, Computer-made perspective movies as a scientiic and communication tool, Communications of the ACM, vol.7, issue.3, p.33, 1964.

Y. Zhao, M. Hategan, B. Cliiord, I. Foster, G. V. Laszewski et al., Swift: Fast, reliable, loosely coupled parallel computation, Services, p.42, 2007.

F. Zheng, H. Abbasi, C. Docan, J. Lofstead, Q. Liu et al., Predata -preparatory data analytics on peta-scale machines, Parallel Distributed Processing (IPDPS), pp.1-12, 2010.

F. Zheng, H. Zou, G. Eisenhauer, K. Schwan, M. Wolf et al., Flexio: I/o middleware for location--exible scientiic data analytics, Parallel Distributed Processing (IPDPS), pp.320-331, 2013.

F. Zheng, H. Abbasi, J. Cao, J. Dayal, K. Schwan et al., In-situ i/o processing: a case for location exibility, Proceedings of the sixth workshop on Parallel Data Storage, p.41, 2011.

F. Zheng, H. Yu, C. Hantas, M. Wolf, G. Eisenhauer et al., Goldrush: Resource efcient in situ scientiic data analytics using negrained interference aware execution, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC '13, vol.78, pp.1-78, 2013.

H. Bruno and . Zimm, The scattering of light and the radial distribution function of high polymer solutions, The Journal of Chemical Physics, vol.16, issue.12, pp.1093-1099, 1948.