A unified runtime system for heterogeneous multicore architectures - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

A unified runtime system for heterogeneous multicore architectures

Résumé

Approaching the theoretical performance of heterogeneous multicore architectures, equipped with specialized accelerators, is a challenging issue. Unlike regular CPUs that can transparently access the whole global memory address range, accelerators usually embed local memory on which they perform all their computations using a specific instruction set. While many research efforts have been devoted to offloading parts of a program over such coprocessors, the real challenge is to find a programming model providing a unified view of all available computing units. In this paper, we present an original runtime system providing a high-level, unified execution model allowing seamless execution of tasks over the underlying heterogeneous hardware. The runtime is based on a hierarchical memory management facility and on a codelet scheduler. We demonstrate the efficiency of our solution with a LU decomposition for both homogeneous (3.8 speedup on 4 cores) and heterogeneous machines (95% efficiency). We also show that a "granularity aware" scheduling can improve execution time by 35%.
Fichier principal
Vignette du fichier
AugNam08HPPC.pdf (177.9 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00326917 , version 1 (06-10-2008)

Identifiants

  • HAL Id : inria-00326917 , version 1

Citer

Cédric Augonnet, Raymond Namyst. A unified runtime system for heterogeneous multicore architectures. 2nd Workshop on Highly Parallel Processing on a Chip (HPPC 2008), Aug 2008, Las Palmas de Gran Canaria, Spain. ⟨inria-00326917⟩
382 Consultations
620 Téléchargements

Partager

Gmail Facebook X LinkedIn More