P. Kogge, ExaScale Computing Study: Technology Challenges in Achieving Exascale Systems, DARPA IPTO, 2008.

T. List, , 2019.

, Green500 List, 2020.

S. Hong and H. Kim, An integrated GPU power and performance model, ACM SIGARCH Computer Architecture News, vol.38, issue.3, 2012.

J. Lucas, S. Lal, M. Andersch, M. Alvarez-mesa, and B. Juurlink, How a single chip causes massive power bills GPUSimPow: A GPGPU power simulator, ISPASS 2013 -IEEE International Symposium on Performance Analysis of Systems and Software, pp.97-106

J. Leng, GPU wattch ?: Enabling energy optimizations in GPG-PUs, Proceedings -International Symposium on Computer Architecture, 2013.

S. Che, Rodinia: A benchmark suite for heterogeneous computing, IEEE International Symposium on Workload Characterization (IISWC), pp.44-54, 2009.

R. A. Bridges, N. Imam, and T. M. Mintz, Understanding GPU Power : A Survey of Profiling, Modeling and Simulation Methods, ACM Comput. Surv, vol.49, p.27, 2016.

X. Ma, L. Zhong, and Z. Deng, Statistical Power Consumption Analysis and Modeling for GPU-based Computing, Proceedings of the SOSP Workshop on Power Aware Computing and Systems (HotPower '09

H. Nagasaka, N. Maruyama, A. Nukada, T. Endo, and S. Matsuoka, Statistical power modeling of GPU kernels using performance counters, International Conference on Green Computing, 2010.

J. Chen, B. Li, Y. Zhang, L. Peng, and J. K. Peir, Statistical GPU power analysis using tree-based methods, International Green Computing Conference and Workshops, 2011.

S. Song, C. Su, B. Rountree, and K. W. Cameron, A simplified and accurate model of power-performance efficiency on emergent GPU architectures, Proceedings -IEEE 27th International Parallel and Distributed Processing Symposium, IPDPS 2013, pp.673-686

W. Jia, E. Garza, K. A. Shaw, and M. Martonosi, GPU Performance and Power Tuning Using Regression Trees, ACM Transactions on Architecture and Code Optimization, vol.12, issue.2, 2015.

K. Kasichayanula, Power aware computing on GPUs. Symposium on Application Accelerators in High-Performance Computing, pp.64-73, 2012.

J. Coplin and M. Burtscher, Power Characteristics of Irregular GPGPU Programs, Workshop on General Purpose Processing Using GPUs, 2014.

H. Wang and Q. Chen, Power Estimating Model and Analysis of General Programming on GPU, vol.7, pp.1164-1170, 2012.

I. Zecena, Energy consumption analysis of parallel sorting algorithms running on multicore systems, International Green Computing Conference (IGCC), pp.1-6, 2012.

M. J. Ikram, O. A. Abulnaja, M. E. Saleh, and M. A. Al-hashimi, Measuring power and energy consumption of programs running on kepler GPUs, 2017 Intl Conf on Advanced Control Circuits Systems (ACCS) Systems and 2017 Intl Conf on New Paradigms in Electronics and Information Technology (PEIT), pp.18-25, 2017.

. Simgrid, , 2020.

S. Williams, A. Waterman, and D. Patterson, Roofline: an insightful visual performance model for multicore architectures, Commun. ACM, vol.52, pp.65-76, 2009.

J. W. Choi, D. Bedard, R. Fowler, and R. Vuduc, A Roofline Model of Energy, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, pp.661-672, 2013.

A. Lopes, F. Pratas, L. Sousa, and A. Ilic, Exploring GPU performance, power and energy-efficiency bounds with Cache-aware Roofline Modeling, 2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp.259-268, 2017.

M. Ghane, J. M. Larkin, L. Shi, S. Chandrasekaran, and M. S. Cheung, Power and Energy-efficiency Roofline Model for GPUs, ArXiv, 2018.

J. W. Sheaffer, K. Skadron, and D. P. Luebke, Fine-grained graphics architectural simulation with Qsilver, ACM SIGGRAPH 2005 Posters (SIGGRAPH '05)

A. Bakhoda, G. L. Yuan, W. W. Fung, H. Wong, and T. M. Aamodt, Analyzing CUDA workloads using a detailed GPU simulator, ISPASS 2009 -International Symposium on Performance Analysis of Systems and Software

J. Zhing and B. He, Kernelet: High-Throughput GPU Kernel Executions with Dynamic Slicing and Scheduling, IEEE Trans. Parallel Distrib, vol.25, issue.6, pp.1522-1532, 2014.

J. Lim, Power Modeling for GPU Architectures Using McPAT, ACM Trans. Des. Autom. Electron. Syst, vol.19, issue.26, p.24, 2014.

S. Li, McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures, 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pp.469-480, 2009.