The top500 benchmark (2017) https://www ,
Intel Xeon Phi Processor: Performance Monitoring Reference Manual ?, 2017. ,
Report on the sunway taihulight system, 2016. ,
Zettascaler-2.0 configurable liquid immersion cooling system, 2017. ,
Clustering Modes in Knights Landing Processors: Developer's Guide, Colfax International, 2016. ,
MCDRAM as High-Bandwith Memory (HBM) in Knights Landing Processors: Developer's Guide, 2016. ,
Programming Abstractions for Data Locality, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01083080
Memkind: A user extensible heap manager, 2018. ,
Simplified interface to complex memory. https: //github, 2017. ,
Hexe: A toolkit for heterogeneous memory management, IEEE International Conference on Parallel and Distributed Systems (ICPADS), 2017. ,
Analyzing allocation behavior for multi-level memory, Proceedings of the Second International Symposium on Memory Systems, MEMSYS '16, pp.204-207, 2016. ,
DOI : 10.1145/2818950.2818978
Automating the Application Data Placement in Hybrid Memory Systems, 2017 IEEE International Conference on Cluster Computing (CLUSTER), pp.126-136, 2017. ,
DOI : 10.1109/CLUSTER.2017.50
Exploring Data Migration for Future Deep-Memory Many-Core Systems, 2016 IEEE International Conference on Cluster Computing (CLUSTER), 2016. ,
DOI : 10.1109/CLUSTER.2016.42
A memory heterogeneity-aware runtime system for bandwidth-sensitive HPC applications, 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp.1293-1300, 2017. ,
DOI : 10.1109/IPDPSW.2017.168
StarPU: a unified platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience, vol.23, issue.4, pp.187-198, 2011. ,
DOI : 10.1002/cpe.1631
URL : https://hal.archives-ouvertes.fr/inria-00550877
XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, pp.1299-1308, 2013. ,
DOI : 10.1109/IPDPS.2013.66
URL : https://hal.archives-ouvertes.fr/hal-00799904
Data-Aware Task Scheduling on Multi-accelerator Based Platforms, 2010 IEEE 16th International Conference on Parallel and Distributed Systems, pp.291-298, 2010. ,
DOI : 10.1109/ICPADS.2010.129
URL : https://hal.archives-ouvertes.fr/inria-00523937
Approximation algorithm for scheduling a chain of tasks on heterogeneous systems, European Conference on Parallel Processing, pp.353-365, 2017. ,
An investigation of Unified Memory Access performance in CUDA, 2014 IEEE High Performance Extreme Computing Conference (HPEC), pp.1-6, 2014. ,
DOI : 10.1109/HPEC.2014.7040988
Computers and Intractability, a Guide to the Theory of NP-Completeness, 1979. ,
Synthesis and Optimization of Digital Circuits, 1994. ,
Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Transactions on Parallel and Distributed Systems, vol.13, issue.3, pp.260-274, 2002. ,
DOI : 10.1109/71.993206
URL : http://meseec.ce.rit.edu/eecc722-fall2002/papers/hc/5/l0260.pdf
A performance model to execute workflows on high-bandwidth memory architectures, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01767888
A standard task graph set for fair evaluation of multiprocessor scheduling algorithms, Journal of Scheduling, vol.70, issue.5, pp.379-394, 2002. ,
DOI : 10.1109/TC.1973.5009153