Small SIMD Matrices for CERN High Throughput Computing - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Small SIMD Matrices for CERN High Throughput Computing

Résumé

System tracking is an old problem and has been heavily optimized throughout the past. However, in High Energy Physics, many small systems are tracked in real-time using Kalman filtering and no implementation satisfying those constraints currently exists. In this paper, we present a code generator used to speed up Cholesky Factorization and Kalman Filter for small matrices. The generator is easy to use and produces portable and heavily optimized code. We focus on current SIMD architectures (SSE, AVX, AVX512, Neon, SVE, Altivec and VSX). Our Cholesky factorization outperforms any existing libraries: from x3 to x10 faster than MKL. The Kalman Filter is also faster than existing implementations, and achieves 4e9 iter/s on a 2x24C Intel Xeon.
Fichier principal
Vignette du fichier
final-draft.pdf (530.48 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01760260 , version 1 (06-04-2018)

Identifiants

Citer

Florian Lemaitre, Benjamin Couturier, Lionel Lacassagne. Small SIMD Matrices for CERN High Throughput Computing. WPMVP 2018 Workshop on Programming Models for SIMD/Vector Processing, Feb 2018, Vienna, Austria. ⟨10.1145/3178433.3178434⟩. ⟨hal-01760260⟩
164 Consultations
929 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More