HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

On the efficiency of the Accelerated Processing Unit for scientific computing

Abstract : The AMD APU (Accelerated Processing Unit) architecture, which combines CPU and GPU cores on the same die at a low power budget, promises a significant advent in GPU computing, in particular to applications which performance is bottlenecked by the low PCI communication rate. However APUs are slowly heading towards a perfectly unified CPU-GPU memory space. Besides, APU integrated GPUs are also less powerful than discrete GPUs. In this paper we therefore investigate the interest of APUs for scientific computing by evaluating and comparing the performance and the power efficiency of an AMD APU (code-named Kaveri) against a discrete GPU (code-named Tahiti) and an hexa-core AMD CPU (Phenom CPU). For this purpose, we rely on two common applicative benchmarks in single precision, namely a matrix-matrix multiplication and a 3D finite difference stencil, that are optimized and tuned in OpenCL. Our results confirm that APU integrated GPUs outperform CPUs and may outperform discrete GPUs for medium-sized problems as well as for applications with high communication requirements. Moreover, our study shows that APUs can be up to 20% more power efficient than discrete GPUs.
Document type :
Conference papers
Complete list of metadata

Contributor : Pierre Fortin Connect in order to contact the contributor
Submitted on : Monday, April 25, 2016 - 11:00:27 AM
Last modification on : Monday, July 12, 2021 - 11:58:18 AM



Issam Said, Pierre Fortin, Jean-Luc Lamotte, Romain Dolbeau, Henri Calandra. On the efficiency of the Accelerated Processing Unit for scientific computing . 24th High Performance Computing Symposium (HPC 2016), Apr 2016, Pasadena, United States. pp.349-356, ⟨10.22360/SpringSim.2016.HPC.040⟩. ⟨hal-01306635⟩



Record views