Skip to Main content Skip to Navigation
Journal articles

Parallelization Schemes for Memory Optimization on the Cell Processor: A Case Study on the Harris Corner Detector

Abstract : The Cell processor is a typical example of a heterogeneous multiprocessor on-chip architecture that uses several levels of parallelism to deliver high performance. Reducing the gap between peak performance and effective performance is the challenge for software tool developers and the application developers. Image processing and media applications are typical "main stream" applications. We use the Harris algorithm for the detection of interest points in an image as a benchmark to compare the performance of several parallel schemes on a Cell processor. The impact of the DMA controlled data transfers and the synchronizations between SPEs explains the differences between the performance of the different parallelization schemes. The scalability of the architecture is modeled and evaluated.
Complete list of metadata

https://hal-mines-paristech.archives-ouvertes.fr/hal-00753708
Contributor : Claire Medrala Connect in order to contact the contributor
Submitted on : Monday, November 19, 2012 - 3:22:23 PM
Last modification on : Wednesday, September 16, 2020 - 4:56:01 PM

Links full text

Identifiers

Collections

Citation

Tarik Saidani, Lionel Lacassagne, Joel Falcou, Claude Tadonki. Parallelization Schemes for Memory Optimization on the Cell Processor: A Case Study on the Harris Corner Detector. Transactions on High-Performance Embedded Architectures and Compilers III, 2011, Vol. 3, pp 177-200. ⟨10.1007/978-3-642-19448-1_10⟩. ⟨hal-00753708⟩

Share

Metrics

Record views

190