Enabling and scaling biomolecular simulations of 100 million atoms on petascale machines with a multicoreoptimized message-driven runtime, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC), vol.61, p.11, 2011. ,
Ale3d: An arbitrary lagrangian-eulerian multiphysics code, vol.5, p.2017 ,
On the merits of distributed work-stealing on selective locality-aware tasks, Proceedings of International Conference on Parallel Processing (ICPP) ,
, , pp.100-109, 2013.
Dense matrix computations on numa architectures with distance-aware work stealing, J. Supercomputing Frontiers and Innovations (JSFI), vol.2, issue.1, 2015. ,
Quantifying the energy efficiency challenges of achieving exascale computing, Proceedings of International Symposium on Cluster, Cloud and Grid Computing (CCGrid), 2015. ,
Thermal aware automated load balancing for hpc applications, 2013 IEEE International Conference on Cluster Computing (CLUS-TER), pp.1-8, 2013. ,
strong " np-completeness results: Motivation, examples, and implications, J. ACM, vol.25, issue.3, pp.499-508, 1978. ,
Complexity of machine scheduling problems, Studies in Integer Programming, ser. Annals of Discrete Mathematics, vol.1, pp.343-362, 1977. ,
Automated load balancing invocation based on application characteristics, International Conference on Cluster Computing (CLUSTER), pp.373-381, 2012. ,
Quantifying the effectiveness of load balance algorithms, International Conference on Supercomputing (ICS), pp.185-194, 2012. ,
Optimization and approximation in deterministic sequencing and scheduling: a survey, Discrete Optimization II, ser. Annals of Discrete Mathematics, vol.5, pp.287-326, 1979. ,
Hypergraph-based dynamic load balancing for adaptive scientific computations, Proceedings of International Parallel and Distributed Processing Symposium (IPDPS), 2007. ,
Applying graph partitioning methods in measurement-based dynamic load balancing, 2012. ,
A hierarchical approach for load balancing on parallel multi-core systems, Proceedings of International Conference on Parallel Processing (ICPP), pp.118-127, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00788012
Fast and high quality topology-aware task mapping, Proceedings of International Parallel and Distributed Processing Symposium (IPDPS), 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01159677
Communication and topology-aware load balancing in charm++ with treematch, International Conference on Cluster Computing (CLUSTER), pp.1-8, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00851148
Strategies for dynamic load balancing on highly parallel computers, IEEE Transactions on Parallel and Distributed Systems (TPDS), vol.4, issue.9, 1993. ,
A distributed dynamic load balancer for iterative applications, Proceedings of International Conference for High Performance Computing, Networking, Storage and Analysis (SC) ,
, , vol.15, p.11, 2013.
A batch task migration approach for decentralized global rescheduling, Proceedings of International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), pp.49-56, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01860626
Scheduling multithreaded computations by work stealing, J. ACM, vol.46, issue.5, pp.720-748, 1999. ,
Scheduling parallel computations by work stealing: A survey, International Journal of Parallel Programming (IJPP), vol.46, issue.2, pp.173-197, 2018. ,
Distributed selfish load balancing, SIAM Journal on Computing, vol.37, issue.4, pp.1163-1181, 2007. ,
Tight & simple load balancing, Proceedings of International Conference on Parallel and Distributed Computing (IPDPS), pp.718-726, 2019. ,
The potential of diffusive load balancing at large scale, Proceedings of European MPI Users' Group Meeting (EuroMPI), pp.154-157, 2016. ,
How to be a successful thief, Proceedings of European Conference on Parallel Processing ,
Contention and locality-aware workstealing for iterative applications in multi-socket computers, IEEE Transactions on Computers, vol.67, issue.6, pp.784-798, 2018. ,
Migpf: Towards on self-organizing process rescheduling of bulksynchronous parallel applications, Future Generation Computer Systems, vol.78, pp.272-286, 2018. ,
Work Stealing and Persistence-based Load Balancers for Iterative Overdecomposed Applications, Proceedings of International Symposium on High-Performance Parallel and Distributed Computing (HPDC), pp.137-148, 2012. ,
Dynamic tracing: Memoization of task graphs for dynamic task-based runtimes, Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, ser. SC, vol.34, pp.1-34, 2018. ,
Massively parallel chess, Proceedings of the DIMACS Parallel Implementation Challenge, 1994. ,
A mean field model of work stealing in large-scale systems, ACM SIGMETRICS Performance Evaluation Review (PER), vol.38, issue.1, pp.13-24, 2010. ,
URL : https://hal.archives-ouvertes.fr/hal-00788862
Periodic hierarchical load balancing for large supercomputers, International Journal of High Performance Computing Applications (IJHPCA), vol.25, pp.371-385, 2011. ,
Pt-scotch: A tool for efficient parallel graph ordering, Parallel computing, vol.34, issue.6-8, pp.318-331, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-00402893
Multi-threaded graph partitioning, Proceedings of International Symposium on Parallel and Distributed Processing (IPDPS), pp.225-236, 2013. ,
Improving the memory access locality of hybrid MPI applications, Proceedings of European MPI Users' Group Meeting (EuroMPI), vol.11, pp.1-11, 2017. ,
Trends in data locality abstractions for HPC systems, IEEE Transactions on Parallel and Distributed Systems (TPDS), vol.28, issue.10, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01621371
A topology-aware load balancing algorithm for clustered hierarchical multi-core machines, Future Generation Computer Systems (FGCS), vol.30, pp.191-201, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00953132
A comprehensive performance evaluation of the BinLPT workload-aware loop scheduler, Concurrency and Computation: Practice and Experience, p.5170, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-01986361
Reducing fragmentation on 3d torus-based hpc systems using packing-based job scheduling and job placement reconfiguration, Proceedings of International Symposium on Parallel and Distributed Computing, pp.34-43, 2017. ,
Optimizing data locality for fork/join programs using constrained work stealing, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC), pp.857-868, 2014. ,
A work stealing scheduler for parallel loops on shared cache multicores, Proceedings of European Conference on Parallel Processing Workshops (EuroParW), pp.99-107, 2010. ,
Almost deterministic work stealing, Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), 2019. ,
, Distributed computing: principles, algorithms, and systems, 2011.
, Scheduling Algorithms, 2001.
Epidemic algorithms for replicated database maintenance, Proceedings of Symposium on Principles of Distributed Computing (PODC), 1987. ,
Distributed quiescence detection in multiagent negotiation, Proceedings International Conference on MultiAgent Systems, pp.317-324, 2000. ,
Parallel Programming with Migratable Objects: Charm++ in Practice, Proceedings of International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2014. ,
A taxonomy of task-based parallel programming technologies for high-performance computing, Springer Journal of Supercomputing, vol.74, issue.4, pp.1422-1434, 2018. ,
Namd: Biomolecular simulation on thousands of processors, SC '02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, pp.36-36, 2002. ,
Improved analysis of deterministic load-balancing schemes, ACM Trans. Algorithms (TALG), vol.15, issue.1, pp.1-10, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01251847