Impact of vectorization and multithreading on performance and energy consumption on Jetson boards

Abstract : ARM processors are well known for their energy efficiency and are consequently widely used in embedded platforms. Like other processor architectures, they are built with different levels of parallelism, from Instruction Level Parallelism (out-of-order and superscalar capabilities) to Thread Level Parallelism (multicore), to increase their performance levels. These processors are now also targeting the HPC domain and will equip the Fujitsu Post-K supercomputer. Some ARM processors from the Cortex-A series, which equip smartphones and tablets, also provide Data Level Parallelism through SIMD units called NEON. These units are able to process 128-bit of data at a time, for example four 32-bit floating point values. Taking advantage of these units requires code vectorization which may be performed automatically by the compiler or explicitly by using NEON intrinsics. Exploiting all these levels of parallelism may lead to better performance as well as a higher energy consumption. This is not an issue in the HPC domain where application development is driven by finding the best performance. However, developing for embedded applications is driven by finding the best trade-off between energy consumption and performance. In this paper, we propose to study the impact of vectorization and multithreading on both performance and energy consumption on some Nvidia Jetson boards. Results show that depending on the algorithm and on its implementation, vectorization may bring a similar speedup as an OpenMP scalar implementation but with a lower energy consumption. However, combining vectorization and multithreading may lead close to both the best performance level and the lowest energy consumption but not when running cores at their maximum frequencies.
Liste complète des métadonnées

https://hal-brgm.archives-ouvertes.fr/hal-01795146
Contributor : Gauthier Sornet <>
Submitted on : Tuesday, August 21, 2018 - 11:25:28 AM
Last modification on : Tuesday, April 2, 2019 - 1:39:34 AM
Document(s) archivé(s) le : Thursday, November 22, 2018 - 1:13:25 PM

File

main-IEEE.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01795146, version 2

Citation

Sylvain Jubertie, Emmanuel Melin, Naly Raliravaka, Emmanuel Bodèle, Pablo Escot Bocanegra. Impact of vectorization and multithreading on performance and energy consumption on Jetson boards. The 2018 International Conference on High Performance Computing & Simulation (HPCS 2018) - HPCS 2018, Jul 2018, Orléans, France. ⟨hal-01795146v2⟩

Share

Metrics

Record views

60

Files downloads

22