Performance portability of hpc discovery science software: Fusion energy turbulence simulations at extreme scale, Supercomputing frontiers and innovations, vol.4, issue.1, 2017. ,
Investigation of supercomputer capabilities for the scalable numerical simulation of computational fluid dynamics problems in industrial applications, Computational Mathematics and Mathematical Physics, vol.56, issue.8, pp.1506-1516, 2016. ,
The bluegene/l supercomputer and quantum chromodynamics, ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC), pp.50-57, 2006. ,
Polypharmacology and supercomputer-based docking: opportunities and challenges, Molecular Simulation, vol.40, issue.10, pp.848-854, 2014. ,
Watson will see you now: a supercomputer to help clinicians make informed treatment decisions, Clinical journal of oncology nursing, vol.19, issue.1, p.31, 2015. ,
Slurm: Simple linux utility for resource management, Job Scheduling Strategies for Parallel Processing, pp.44-60, 2003. ,
Cobalt: an open source platform for hpc system software research, Edinburgh BG/L System Software Workshop, pp.803-820, 2005. ,
Torque resource manager, ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC), p.8, 2006. ,
DMTCP: Transparent checkpointing for cluster computations and the desktop, IEEE International Symposium on Parallel & Distributed Processing (IPDPS), pp.1-12, 2009. ,
Veloc: Towards high performance adaptive asynchronous checkpointing at large scale, IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp.911-920, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02184203
A higher order estimate of the optimum checkpoint interval for restart dumps, Future Generation Computer Systems, vol.22, issue.3, pp.303-312, 2006. ,
A survey on scheduling algorithms for parallel and distributed systems, Silicon Photonics & High Performance Computing, pp.51-64, 2018. ,
Parallel job scheduling policies to improve fairness: A case study, International Conference on Parallel Processing Workshops (ICPP), pp.346-353, 2010. ,
A comparative study on resource allocation and energy efficient job scheduling strategies in large-scale parallel computing systems, Cluster computing, vol.17, issue.4, pp.1349-1367, 2014. ,
Utilization, predictability, workloads, and user runtime estimates in scheduling the ibm sp2 with backfilling, IEEE Transactions on Parallel and Distributed Systems (TPDS), vol.12, issue.6, pp.529-543, 2001. ,
Fattened backfilling: An improved strategy for job scheduling in parallel systems, Journal of Parallel and Distributed Computing (JPDC), vol.97, pp.69-77, 2016. ,
Multiple-queue backfilling scheduling with priorities and reservations for parallel systems, ACM SIGMETRICS Performance Evaluation Review, vol.29, pp.72-87, 2002. ,
An efficient thread mapping strategy for multiprogramming on manycore processors, Advances in Parallel Computing, vol.25, pp.63-71, 2014. ,
Data-intensive workflow optimization based on application task graph partitioning in heterogeneous computing systems, IEEE International Conference on Big Data and Cloud Computing, pp.129-136, 2014. ,
Supporting Real-Time Jobs on the IBM Blue Gene/Q: Simulation-Based Study, Job Scheduling Strategies for Parallel Processing, pp.83-102, 2018. ,
Enabling urgent computing within the existing distributed computing infrastructure, 2011. ,
Preemption based backfill, Job Scheduling Strategies for Parallel Processing, pp.24-37, 2002. ,
A large scale study of data center network reliability, ACM Internet Measurement Conference (IMC), pp.393-407, 2018. ,
B4: Experience with a Globally Deployed Software Defined WAN, ACM SIGCOMM, 2013. ,
Spanner: Google's globally distributed database, ACM Transactions on Computer Systems (TOCS), vol.31, issue.3, 2013. ,
Global analytics in the face of bandwidth and regulatory constraints, USENIX Networked Systems Design and Implementation (NSDI), pp.323-336, 2015. ,
f4: Facebook's Warm BLOB Storage System, USENIX Operating Systems Design and Implementation (OSDI), pp.383-398, 2014. ,