A. , R. And-kennedy, K. ]. , and D. H. , Optimizing compilers for modern architectures: a dependence-based approach LHCb Kalman Filter cross architecture studies, Journal of Physics Conference Series, vol.89, issue.3, p.32052, 2002.

C. , A. Le-gal, B. Leroux, C. Aumage, O. And-barthou et al., An efficient, portable and generic library for successive cancellation decoding of polar codes, International Workshop on Languages and Compilers for Parallel Computing, pp.303-317, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01203105

C. , G. Elmer, P. Krutelyov, S. Lantz, S. Lefebvre et al., Kalman filter tracking on parallel architectures, Journal of Physics Conference Series, vol.898, issue.4, p.42051, 2017.

D. , T. Haidar, A. Luszczek, P. Harris, J. A. Tomov et al., LU factorization of small matrices: accelerating batched DGETRF on the GPU, High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst 2014 IEEE Intl Conf on, pp.157-160, 2014.

E. , P. Falcou, J. Gaunard, M. And-laprestélaprest´lapresté, and J. Boost, SIMD: Generic programming for portable SIMDization, Proceedings of the 2014 Workshop on Programming Models for SIMD/Vector Processing, pp.14-15
URL : https://hal.archives-ouvertes.fr/hal-01759064

F. Fr¨uhwirth and R. , Application of Kalman filtering to track and vertex fitting. Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, pp.444-450, 1987.

G. , S. Kebschull, U. Kisel, I. Lindenstruth, V. And-m-¨-uller et al., Fast SIMDized Kalman filter based track fit, Computer Physics Communications, vol.178, issue.5, pp.374-383, 2008.

K. Karpi´nski, P. And-mcdonald, and J. , A high-performance portable abstract interface for explicit SIMD vectorization, Proceedings of the 8th International Workshop on Programming Models and Applications for Multicores and Manycores PMAM'17, ACM, pp.21-28

K. , M. And-lindenstruth, and V. , Vc: A C++ library for explicit vectorization. Software: Practice and Experience, pp.1409-1430, 2012.

L. , L. Etiemble, D. Hassan-zahraee, A. Dominguez, A. And-vezolle et al., High level transforms for SIMD and low-level computer vision algorithms, ACM Workshop on Programming Models for SIMD/Vector Processing (PPoPP) (2014), pp.49-56
URL : https://hal.archives-ouvertes.fr/hal-01094906

L. , F. Couturier, B. And-lacassagne, and L. , Cholesky factorization on simd multi-core architectures, Journal of Systems Architecture C, vol.79, pp.1-15, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01550129

M. , I. Baboulin, M. And-falcou, and J. , Metaprogramming dense linear algebra solvers applications to multi and many-core architectures, Trustcom/BigDataSE/ISPA, pp.69-76, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01221358

P. , M. A. And-krecker, and D. K. , Parallel Kalman filtering on the connection machine, Frontiers of Massively Parallel Computation Proceedings., 3rd Symposium on the, pp.55-58, 1990.

P. , M. And-mark, and W. , ispc: A SPMD compiler for highperformance CPU programming, Innovative Parallel Computing (InPar), pp.2012-2013, 2012.

S. , P. And-leeser, and M. , Area and performance tradeoffs in floating-point divide and square-root implementations, ACM Comput. Surv, vol.28, pp.3-518, 1996.