E. Strohmaier, The top500 benchmark (2017) https://www

. Intel, Intel Xeon Phi Processor: Performance Monitoring Reference Manual ?, 2017.

J. Dongarra, Report on the sunway taihulight system, 2016.

P. Computing, Zettascaler-2.0 configurable liquid immersion cooling system, 2017.

R. Asai, Clustering Modes in Knights Landing Processors: Developer's Guide, Colfax International, 2016.

A. Vladimirov and R. Asai, MCDRAM as High-Bandwith Memory (HBM) in Knights Landing Processors: Developer's Guide, 2016.

D. Unat, J. Shalf, T. Hoefler, and T. A. Schulthess, Programming Abstractions for Data Locality, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01083080

I. Corporation, Memkind: A user extensible heap manager, 2018.

L. A. Laboratory, Simplified interface to complex memory. https: //github, 2017.

L. Oden and P. Balaji, Hexe: A toolkit for heterogeneous memory management, IEEE International Conference on Parallel and Distributed Systems (ICPADS), 2017.

G. Voskuilen, A. F. Rodrigues, and S. D. Hammond, Analyzing allocation behavior for multi-level memory, Proceedings of the Second International Symposium on Memory Systems, MEMSYS '16, pp.204-207, 2016.
DOI : 10.1145/2818950.2818978

H. Servat, A. J. Peña, G. Llort, E. Mercadal, H. Hoppe et al., Automating the Application Data Placement in Hybrid Memory Systems, 2017 IEEE International Conference on Cluster Computing (CLUSTER), pp.126-136, 2017.
DOI : 10.1109/CLUSTER.2017.50

S. Perarnau, J. A. Zounmevo, B. Gerofi, K. Iskra, and P. Beckman, Exploring Data Migration for Future Deep-Memory Many-Core Systems, 2016 IEEE International Conference on Cluster Computing (CLUSTER), 2016.
DOI : 10.1109/CLUSTER.2016.42

K. Chandrasekar, X. Ni, and L. V. Kalé, A memory heterogeneity-aware runtime system for bandwidth-sensitive HPC applications, 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp.1293-1300, 2017.
DOI : 10.1109/IPDPSW.2017.168

C. Augonnet, S. Thibault, R. Namyst, and P. A. Wacrenier, StarPU: a unified platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience, vol.23, issue.4, pp.187-198, 2011.
DOI : 10.1002/cpe.1631

URL : https://hal.archives-ouvertes.fr/inria-00550877

T. Gautier, J. V. Lima, N. Maillard, and B. Raffin, XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, pp.1299-1308, 2013.
DOI : 10.1109/IPDPS.2013.66

URL : https://hal.archives-ouvertes.fr/hal-00799904

C. Augonnet, J. Clet-ortega, S. Thibault, and R. Namyst, Data-Aware Task Scheduling on Multi-accelerator Based Platforms, 2010 IEEE 16th International Conference on Parallel and Distributed Systems, pp.291-298, 2010.
DOI : 10.1109/ICPADS.2010.129

URL : https://hal.archives-ouvertes.fr/inria-00523937

M. A. Aba, L. Zaourar, and A. Munier, Approximation algorithm for scheduling a chain of tasks on heterogeneous systems, European Conference on Parallel Processing, pp.353-365, 2017.

R. Landaverde, T. Zhang, A. K. Coskun, and M. Herbordt, An investigation of Unified Memory Access performance in CUDA, 2014 IEEE High Performance Extreme Computing Conference (HPEC), pp.1-6, 2014.
DOI : 10.1109/HPEC.2014.7040988

M. R. Garey and D. S. Johnson, Computers and Intractability, a Guide to the Theory of NP-Completeness, 1979.

G. D. Micheli, Synthesis and Optimization of Digital Circuits, 1994.

H. Topcuoglu, S. Hariri, and M. Y. Wu, Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Transactions on Parallel and Distributed Systems, vol.13, issue.3, pp.260-274, 2002.
DOI : 10.1109/71.993206

URL : http://meseec.ce.rit.edu/eecc722-fall2002/papers/hc/5/l0260.pdf

A. Benoit, S. Perarnau, L. Pottier, and Y. Robert, A performance model to execute workflows on high-bandwidth memory architectures, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01767888

T. Tobita and H. Kasahara, A standard task graph set for fair evaluation of multiprocessor scheduling algorithms, Journal of Scheduling, vol.70, issue.5, pp.379-394, 2002.
DOI : 10.1109/TC.1973.5009153