, Datashader

. Scheddisplay,

. Schedlog,

E. Altman, M. Arnold, S. Fink, M. , and N. , Performance analysis of idle programs, Proceedings of the ACM International Conference on Object Oriented Programming Systems Languages and Applications, pp.739-753, 2010.

T. E. Anderson, B. N. Bershad, E. D. Lazowska, and H. M. Levy, Scheduler activations: Effective kernel support for the user-level management of parallelism, ACM Transactions on Computer Systems (TOCS), vol.10, pp.53-79, 1992.

S. Bird, L. Canavan, C. Hulsey, M. P. Rudiger, B. V. De-ven et al., , 2013.

J. Bouron, S. Chevalley, B. Lepers, W. Zwaenepoel, R. Gouicem et al., The battle of the schedulers: FreeBSD ULE vs, USENIX Annual Technical Conference, USENIX ATC 2018, pp.85-96, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01853267

S. Boyd-wickizer, A. T. Clements, Y. Mao, A. Pesterev, M. F. Kaashoek et al., An analysis of Linux scalability to many cores, OSDI, 2010.

T. Brecht, On the importance of parallel application placement in NUMA Multiprocessors, USENIX SEDMS, 1993.

A. Chanda, A. L. Cox, and W. Zwaenepoel, Whodunit: Transactional profiling for multi-tier applications, EuroSys, pp.17-30, 2007.

T. Chen, L. I. Ananiev, and A. V. Tikhonov, Keeping kernel performance from regressions, Linux Symposium, vol.1, pp.93-102, 2007.

J. Corbet, TurboSched: the return of small-task packing, Linux Weekly News, 2019.

M. Dashti, A. Fedorova, J. Funston, F. Gaud, R. Lachaize et al., Traffic management: a holistic approach to memory placement on NUMA systems, ASPLOS, pp.381-394, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00945758

A. C. De-melo, Performance counters on Linux, Linux Plumbers Conference, vol.118, 2009.

A. C. De-melo, The new Linux 'perf' tools, Slides from Linux Kongress, vol.18, 2010.

K. Fürlinger and M. Gerndt, ompP: A profiling tool for OpenMP, International Workshop on OpenMP, pp.15-23, 2005.

A. S. Harji, P. A. Buhr, and T. Brecht, Our troubles with Linux and why you should care, Proceedings of the Second Asia-Pacific Workshop on Systems, vol.2, pp.1-2, 2011.

N. Joukov, A. Traeger, R. Iyer, C. P. Wright, and E. Zadok, Operating system profiling via latency analysis, Proceedings of the 7th symposium on Operating systems design and implementation, pp.89-102, 2006.

B. Lepers, V. Quéma, and A. Fedorova, Thread and memory placement on NUMA systems: asymmetry matters, USENIX ATC, pp.277-289, 2015.

J. Lozi, F. David, G. Thomas, J. Lawall, and G. Muller, Remote core locking: Migrating critical-section execution to improve the performance of multithreaded applications, Presented as part of the 2012 USENIX Annual Technical Conference, vol.12, pp.65-76, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00779908

J. Lozi, B. Lepers, J. Funston, F. Gaud, V. Quéma et al., The Linux scheduler: a decade of wasted cores, EuroSys, vol.1, p.16, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01295194

M. S. Mollison, B. Brandenburg, A. , and J. H. , Towards unit testing real-time schedulers in LITMUS RT, Proceedings of the 5th Workshop on Operating Systems Platforms for Embedded Real-Time Applications, 2009.

S. Panneerselvam and M. Swift, Rinnegan: Efficient resource use in heterogeneous architectures, Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, pp.373-386, 2016.

, Linux profiling with performance counters

S. E. Perl and W. E. Weihl, Performance assertion checking, SOSP, pp.134-145, 1993.

P. Shah, TurboSched: A scheduler for sustaining turbo frequencies for longer durations, 2019.

K. Shen, M. Zhong, L. , and C. , I/O system performance debugging using model-driven anomaly characterization, FAST, pp.309-322, 2005.

D. Tam, R. Azimi, and M. Stumm, Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors, EuroSys, pp.47-58, 2007.

L. Tang, J. Mars, X. Zhang, R. Hagmann, R. Hundt et al., Optimizing Google's warehouse scale computers: The NUMA experience, High Performance Computer Architecture (HPCA2013), pp.188-197, 2013.

P. Tseng, P. Hsiu, C. Pan, and T. Kuo, User-centric energy-efficient scheduling on multi-core mobile devices, Proceedings of the 51st Annual Design Automation Conference, pp.1-6, 2014.

A. Verma, L. Pedrosa, M. Korupolu, D. Oppenheimer, E. Tune et al., Large-scale cluster management at Google with Borg, Proceedings of the Tenth European Conference on Computer Systems, p.18, 2015.

V. M. Weaver, Linux perf_event features and overhead, The 2nd International Workshop on Performance Analysis of Workload Optimized Systems, vol.13, 2013.

K. Yu, D. Han, C. Youn, S. Hwang, and J. Lee, Power-aware task scheduling for big.LITTLE mobile processor, International SoC Design Conference (ISOCC), pp.208-212, 2013.

X. Zhang, S. Dwarkadas, and R. Zhong, An evaluation of perchip nonuniform frequency scaling on multicores, USENIX ATC, 2010.

S. Zhuravlev, J. C. Saez, S. Blagodurov, A. Fedorova, and M. Prieto, Survey of scheduling techniques for addressing shared resources in multicore processors, ACM Computing Surveys (CSUR), vol.45, p.4, 2012.