Y. Barsamian, S. A. Hirstoaga, and E. Violard, Efficient Data Structures for a Hybrid Parallel and Vectorized Particle-in-Cell Code, 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp.2017-1168
DOI : 10.1109/IPDPSW.2017.74

URL : https://hal.archives-ouvertes.fr/hal-01504645

R. W. Hockney and J. W. Eastwood, Computer Simulation Using Particles, Institute of Physics, 1988.
DOI : 10.1201/9781439822050

K. J. Bowers, B. J. Albright, L. Yin, B. Bergen, and T. J. Kwan, Ultrahigh performance three-dimensional electromagnetic relativistic kinetic plasma simulation, Physics of Plasmas, vol.49, issue.5, 2008.
DOI : 10.1063/1.2768933

V. K. Decyk, S. R. Karmesin, A. De-boer, and P. C. Liewer, Optimization of particle-in-cell codes on reduced instruction set computer processors, Computers in Physics, vol.10, issue.3, pp.290-298, 1996.
DOI : 10.1063/1.168571

H. Vincenti, M. Lobet, R. Lehe, R. Sasanka, and J. Vay, An efficient and portable SIMD algorithm for charge/current deposition in Particle-In-Cell codes, Computer Physics Communications, vol.210, pp.145-154, 2016.
DOI : 10.1016/j.cpc.2016.08.023

URL : https://hal.archives-ouvertes.fr/cea-01426502

K. Germaschewski, W. Fox, S. Abbott, N. Ahmadi, K. Maynard et al., The Plasma Simulation Code: A modern particle-in-cell code with patch-based load-balancing, Journal of Computational Physics, vol.318, pp.305-326, 2016.
DOI : 10.1016/j.jcp.2016.05.013

URL : https://manuscript.elsevier.com/S0021999116301413/pdf/S0021999116301413.pdf

V. K. Decyk and T. V. Singh, Particle-in-Cell algorithms for emerging computer architectures, Computer Physics Communications, vol.185, issue.3
DOI : 10.1016/j.cpc.2013.10.013

URL : https://doi.org/10.1016/j.cpc.2013.10.013

A. Jocksch, F. Hariri, T. Tran, S. Brunner, C. Gheller et al., A Bucket Sort Algorithm for the Particle-In-Cell Method on Manycore Architectures, Parallel Processing and Applied Mathematics: 11th Intl. Conf. (PPAM), 2016, pp.43-52
DOI : 10.1007/978-3-319-32149-3_5

E. Chacon-golcher, S. A. Hirstoaga, and M. Lutz, Optimization of particlein-cell simulations for Vlasov-Poisson system with strong magnetic field, ESAIM: Proceedings and Surveys 53, pp.177-190, 2016.
DOI : 10.1051/proc/201653011

URL : https://www.esaim-proc.org/articles/proc/pdf/2016/01/proc165311.pdf

F. Panneton, P. L. Ecuyer, and M. Matsumoto, Improved long-period generators based on linear recurrences modulo 2, Source Code, pp.1-16, 2006.
DOI : 10.1145/1132973.1132974

URL : http://www.iro.umontreal.ca/~lecuyer/myftp/papers/wellrng.pdf

C. K. Birdsall and D. , Clouds-in-clouds, clouds-in-cells physics for many-body plasma simulation, Journal of Computational Physics, vol.3, issue.4, pp.494-511, 1969.
DOI : 10.1016/0021-9991(69)90058-8

K. J. Bowers, Speed optimal implementation of a fully relativistic particle push with charge conserving current accumulation on modern processors, Proceedings of the 18th Int. Conf. Numerical Simulation of Plasmas, pp.383-386, 2003.

T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein, Introduction to Algorithms, 2009.

S. Chatterjee, V. V. Jain, A. R. Lebeck, S. Mundhra, and M. Thottethodi, Nonlinear array layouts for hierarchical memory systems, Proceedings of the 13th international conference on Supercomputing , ICS '99, pp.444-453, 1999.
DOI : 10.1145/305138.305231

URL : http://www.cs.duke.edu/%7Ealvy/papers/ics99.pdf

J. Mellor-crummey, D. Whalley, and K. Kennedy, Improving memory hierarchy performance for irregular applications using data and computation reorderings, Intl, International Journal of Parallel Programming, vol.29, issue.3, pp.217-247, 2001.
DOI : 10.1023/A:1011119519789

URL : http://cacs.usc.edu/education/cs596/Mellor-opt-ISC99.pdf

D. Deford and A. Kalyanaraman, Empirical Analysis of Space-Filling Curves for Scientific Computing Applications, 2013 42nd International Conference on Parallel Processing, pp.170-179
DOI : 10.1109/ICPP.2013.26

M. Bussmann, H. Burau, T. E. Cowan, A. Debus, A. Huebl et al., Radiative signatures of the relativistic Kelvin-Helmholtz instability, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '13, pp.1-5, 2013.
DOI : 10.1145/2503210.2504564

URL : http://dl.acm.org/ft_gateway.cfm?id=2504564&type=pdf

I. Surmin, S. Bastrakov, Z. Matveev, E. Efimenko, A. Gonoskov et al., Co-design of a Particle-in-Cell Plasma Simulation Code for Intel Xeon Phi: A First Look at Knights Landing, Proceedings of the 16th Intl. Conf. on Algorithms and Architectures for Parallel Processing Collocated Workshops (ICA3PP, SCDT), pp.319-329, 2016.
DOI : 10.1145/1498765.1498785

M. Frigo and S. G. Johnson, The Design and Implementation of FFTW3, Proceedings of the IEEE, vol.93, issue.2, pp.216-231, 2005.
DOI : 10.1109/JPROC.2004.840301

URL : http://math.mit.edu/~stevenj/papers/FrigoJo05.pdf

M. J. Wolfe, High Performance Compilers for Parallel Computing, 1995.

G. M. Morton, A computer oriented geodetic data base and a new technique in file sequencing, Tech. rep., IBM Ltd, 1966.

R. Raman and D. S. Wise, Converting to and from Dilated Integers, IEEE Transactions on Computers, vol.57, issue.4, 2008.
DOI : 10.1109/TC.2007.70814

C. Severance and K. Dowd, High Performance Computing, OpenStax CNX, 2010.

J. D. Mccalpin, Memory bandwidth and machine balance in current high performance computers, IEEE Computer Society Technical Committee on Computer Architecture Newsletter (TCCA), pp.19-25, 1995.