Skip to Main content Skip to Navigation
Journal articles

Performance Optimization and Modeling of Blocked Sparse Kernels

Abstract : We present a method for automatically selecting optimal implementations of sparse matrix-vector operations. Our software “AcCELS” (Accelerated Compress-storage Elements for Linear Solvers) involves a setup phase that probes machine characteristics, and a run-time phase where stored characteristics are combined with a measure of the actual sparse matrix to find the optimal kernel implementation. We present a performance model that is shown to be accurate over a large range of matrices.
Complete list of metadatas
Contributor : Alfredo Buttari <>
Submitted on : Friday, December 20, 2019 - 11:54:14 AM
Last modification on : Tuesday, September 8, 2020 - 9:00:04 AM

Links full text



Alfredo Buttari, Victor Eijkhout, Julien Langou, Salvatore Filippone. Performance Optimization and Modeling of Blocked Sparse Kernels. International Journal of High Performance Computing Applications, SAGE Publications, 2016, 21 (4), pp.467-484. ⟨10.1177/1094342007083801⟩. ⟨hal-02421133⟩



Record views