Implementations Impact on Iterative Image Processing for Embedded GPU - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Implementations Impact on Iterative Image Processing for Embedded GPU

Résumé

The emergence of low-power embedded Graphical Processing Units (GPUs) with high computation capabilities has enabled the integration of image processing chains in a wide variety of embedded systems. Various optimisation techniques are however needed in order to get the most out of an embedded GPU. This paper explores several optimisation methods for iterative stencil-like image processing algorithms on embedded NVIDIA GPUs using the Compute Unified Device Architecture (CUDA) API. We chose to focus our architectural optimisations on the TV-L1 algorithm, an optical flow estimation method based on total variation (TV) regularisation and the L1 norm. It is widely used as a model for more complex optical flow estimations and is used in many recent video processing applications. In this work we evaluate the impact of architecture-oriented optimisations on both execution time and energy consumption on several Nvidia Jetson GPU embedded boards. Results show a speedup up to 3× compared to State-of-the-Art versions as well as a 2.6× decrease in energy consumption.
Fichier principal
Vignette du fichier
EUSIPCO21_OF.pdf (1.51 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03330779 , version 1 (01-09-2021)

Identifiants

  • HAL Id : hal-03330779 , version 1

Citer

Thomas Romera, Andrea Petreto, Florian Lemaitre, Manuel Bouyer, Quentin Meunier, et al.. Implementations Impact on Iterative Image Processing for Embedded GPU. European Signal Processing Conference (EUSIPCO), Aug 2021, Dublin, Ireland. ⟨hal-03330779⟩
86 Consultations
135 Téléchargements

Partager

Gmail Facebook X LinkedIn More