K. Bergman, S. Borkar, D. Campbell, W. Carlson, W. Dally et al., Exascale computing study: Technology challenges in achieving exascale systems, 2008.

X. Yang, Z. Zhou, S. Wallace, Z. Lan, W. Tang et al., Integrating dynamic pricing of electricity into energy aware scheduling for hpc systems Networking, Storage and Analysis, ser. SC '13, Proceedings of the International Conference on High Performance Computing, pp.1-6011, 2013.

M. A. Bari, N. Chaimov, A. M. Malik, K. A. Huck, B. Chapman et al., ARCS: Adaptive Runtime Configuration Selection for Power-Constrained OpenMP Applications, 2016 IEEE International Conference on Cluster Computing (CLUSTER), pp.461-470, 2016.
DOI : 10.1109/CLUSTER.2016.39

A. K. Porterfield, S. L. Olivier, S. Bhalachandra, and J. F. Prins, Power Measurement and Concurrency Throttling for Energy Reduction in OpenMP Programs, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, pp.884-891, 2013.
DOI : 10.1109/IPDPSW.2013.15

A. Nandamuri, A. M. Malik, A. Qawasmeh, and B. M. Chapman, Power and Energy Footprint of OpenMP Programs Using OpenMP Runtime API, 2014 Energy Efficient Supercomputing Workshop, pp.79-88, 2014.
DOI : 10.1109/E2SC.2014.11

C. Su, D. Li, D. S. Nikolopoulos, K. W. Cameron, B. R. Supinski et al., Model-based, memory-centric performance and power optimization on NUMA multiprocessors, 2012 IEEE International Symposium on Workload Characterization (IISWC), pp.164-173, 2012.
DOI : 10.1109/IISWC.2012.6402921

URL : http://scape.cs.vt.edu/wp-content/uploads/2014/03/iiswc12.pdf

C. Lively, X. Wu, V. Taylor, S. Moore, H. Chang et al., Energy and performance characteristics of different parallel implementations of scientific applications on multicore systems, The International Journal of High Performance Computing Applications, vol.10, issue.10, pp.342-350, 2011.
DOI : 10.1145/1964218.1964228

D. Li, B. R. De-supinski, M. Schulz, K. Cameron, and D. S. Nikolopoulos, Hybrid MPI/OpenMP power-aware computing, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), pp.1-12, 2010.
DOI : 10.1109/IPDPS.2010.5470463

URL : http://scape.cs.vt.edu/wp-content/uploads/2012/08/ipdps10_hybrid.pdf

P. Virouleau, P. Brunet, F. Broquedis, N. Furmento, S. Thibault et al., Evaluation of OpenMP Dependent Tasks with the KASTORS Benchmark Suite, 10th International Workshop on OpenMP, ser. IWOMP'14, pp.16-29, 2014.
DOI : 10.1007/978-3-319-11454-5_2

URL : https://hal.archives-ouvertes.fr/hal-01081974

A. Yarkhan, J. Kurzak, P. Luszczek, and J. Dongarra, Porting the PLASMA Numerical Library to the OpenMP Standard, International Journal of Parallel Programming, vol.37, issue.9, pp.1-22, 2016.
DOI : 10.1109/SERVICES.2007.63

M. Curtis-maury, J. Dzierwa, C. D. Antonopoulos, and D. S. Nikolopoulos, Online strategies for high-performance power-aware thread execution on emerging multiprocessors, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium, 2006.
DOI : 10.1109/IPDPS.2006.1639598

URL : http://www.cs.wm.edu/~dsn/papers/HPPAC_2006.pdf

A. K. Porterfield, S. L. Olivier, S. Bhalachandra, and J. F. Prins, Power Measurement and Concurrency Throttling for Energy Reduction in OpenMP Programs, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, pp.884-891, 2013.
DOI : 10.1109/IPDPSW.2013.15

A. Duran, X. Teruel, R. Ferrer, X. Martorell, and E. Ayguade, Barcelona OpenMP Tasks Suite: A Set of Benchmarks Targeting the Exploitation of Task Parallelism in OpenMP, 2009 International Conference on Parallel Processing, pp.124-131, 2009.
DOI : 10.1109/ICPP.2009.64

M. Frigo, C. E. Leiserson, and K. H. Randall, The implementation of the Cilk-5 multithreaded language, ACM SIGPLAN Notices, vol.33, issue.5, pp.212-223, 1998.
DOI : 10.1145/277652.277725

T. Gautier, J. V. Lima, N. Maillard, and B. Raffin, XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, pp.1299-1308, 2013.
DOI : 10.1109/IPDPS.2013.66

URL : https://hal.archives-ouvertes.fr/hal-00799904

F. Broquedis, T. Gautier, and V. Danjean, libKOMP, an Efficient OpenMP Runtime System for Both Fork-Join and Data Flow Paradigms, Proceedings of the 8th International Conference on OpenMP in a Heterogeneous World, ser. IWOMP'12, pp.102-115, 2012.
DOI : 10.1007/978-3-642-30961-8_8

URL : https://hal.archives-ouvertes.fr/hal-00796253

M. Tchiboukdjian, N. Gast, and D. Trystram, Decentralized list scheduling, Annals of Operations Research, vol.18, issue.2, pp.237-259, 2013.
DOI : 10.1007/978-3-642-17514-5_25

URL : https://hal.archives-ouvertes.fr/hal-00796248

P. Virouleau, F. Broquedis, T. Gautier, and F. Rastello, Using Data Dependencies to Improve Task-Based Scheduling Strategies on NUMA Architectures, Proceedings of the 22Nd International Conference on Euro-Par 2016: Parallel Processing, pp.531-544
DOI : 10.1007/978-3-319-11454-5_2

URL : https://hal.archives-ouvertes.fr/hal-01338761

T. Gautier and P. Virouleau, New libkomp library, 2015.

J. Treibig, G. Hager, and G. Wellein, LIKWID: Lightweight Performance Tools, 1104.
DOI : 10.1007/978-3-642-24025-6_14

URL : http://arxiv.org/pdf/1104.4874

H. Ribic and Y. D. Liu, Energy-efficient work-stealing language runtimes, Proceedings of the 19th international conference on Architectural support for programming languages and operating systems, ASPLOS '14, pp.513-528, 2014.
DOI : 10.1145/2541940.2541971

URL : http://www.cs.binghamton.edu/~davidl/papers/hermes.pdf