Skip to Main content Skip to Navigation
New interface
Reports (Research report)

Identifying scalar behavior in CUDA kernels

Caroline Collange 1, * 
* Corresponding author
1 ARENAIRE - Computer arithmetic
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
Abstract : We propose a compiler analysis pass for programs expressed in the Single Program, Multiple Data (SPMD) programming model. It identifies statically several kinds of regular patterns that can occur between adjacent threads, including common computations, memory accesses at consecutive locations or at the same location and uniform control flow. This knowledge can be exploited by SPMD compilers targeting SIMD architectures. We present a compiler pass developed within the Ocelot framework that performs this analysis on NVIDIA CUDA programs at the PTX intermediate language level. Results are compared with optima obtained by simulation of several sets of CUDA benchmarks.
Complete list of metadata

Cited literature [17 references]  Display  Hide  Download
Contributor : Caroline Collange Connect in order to contact the contributor
Submitted on : Wednesday, January 12, 2011 - 2:20:33 PM
Last modification on : Wednesday, October 26, 2022 - 8:14:28 AM
Long-term archiving on: : Tuesday, November 6, 2012 - 11:20:11 AM


Files produced by the author(s)


  • HAL Id : hal-00555134, version 1


Caroline Collange. Identifying scalar behavior in CUDA kernels. [Research Report] ENS Lyon. 2011. ⟨hal-00555134⟩



Record views


Files downloads