OpenCL FPGA Optimization guided by memory accesses and roofline model analysis applied to tomography acceleration - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

OpenCL FPGA Optimization guided by memory accesses and roofline model analysis applied to tomography acceleration

Daouda Diakite
Nicolas Gac
Maxime Martelli

Résumé

Backward projection is one of the most time-consuming steps in method-based iterative reconstruction computed tomography. The 3D backprojection memory access pattern is potentially enough regular to exploit efficiently the computation power of acceleration boards based on GPU or FPGA. The highlevel tools like HLS or OpenCL ease consider such particular memory accesses during the design flow without specific hardware IPs. This paper proposes an OpenCL acceleration of the voxel-driven 3D back-projection algorithm on an Arria 10 FPGA. This design flow is based initially on an offline memory access analysis, then iteratively on a performance analysis of each new implementation represented on a Berkeley Roofline model. By taking advantage of the FPGAs local memory architecture, we have succeeded to design an efficient pipeline reaching maximum bandwidth with stall-free access underlining this platform's interest for memory optimization. Our design flow allowed for a significant improvement of our initial algorithm's computational intensity, resulting in better performance on FPGA. It reaches comparable performance to an embedded GPU implementation and other computed tomography algorithms on FPGAs.
Fichier principal
Vignette du fichier
2021105741.pdf (706.77 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03226257 , version 1 (20-05-2021)
hal-03226257 , version 2 (14-10-2021)

Identifiants

Citer

Daouda Diakite, Nicolas Gac, Maxime Martelli. OpenCL FPGA Optimization guided by memory accesses and roofline model analysis applied to tomography acceleration. 31st International Conference on Field Programmable Logic and Applications (FPL), Aug 2021, Dresden (virtual), Germany. pp.109-114, ⟨10.1109/FPL53798.2021.00026⟩. ⟨hal-03226257v2⟩
139 Consultations
424 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More