J. Dongarra, S. Moore, G. Peterson, S. Tomov, J. Allred et al., Exploring New Architectures in Accelerating CFD for Air Force Applications, 2008 DoD HPCMP Users Group Conference, pp.14-17, 2008.
DOI : 10.1109/DoD.HPCMP.UGC.2008.12

S. Tomov, J. Dongarra, and M. Baboulin, Towards dense linear algebra for hybrid GPU accelerated manycore systems, Parallel Computing, vol.36, issue.5-6
DOI : 10.1016/j.parco.2009.12.005

. Nvidia, Compute Unified Device Architecture Programming Guide version 2, 2009.

J. Tölke, Implementation of a Lattice Boltzmann kernel using the Compute Unified Device Architecture developed by nVIDIA, Computing and Visualization in Science, vol.17, issue.4, pp.1-11
DOI : 10.1007/s00791-008-0120-2

G. R. Mcnamara and G. Zanetti, Use of the Boltzmann Equation to Simulate Lattice-Gas Automata, Physical Review Letters, vol.61, issue.20, pp.61-2332, 1988.
DOI : 10.1103/PhysRevLett.61.2332

Y. H. Qian, D. Humières, and P. Lallemand, Lattice BGK Models for Navier-Stokes Equation, Europhysics Letters (EPL), vol.17, issue.6, pp.479-484, 1992.
DOI : 10.1209/0295-5075/17/6/001

D. Humières, I. Ginzburg, M. Krafczyk, P. Lallemand, and L. Luo, Multiple-relaxation-time lattice Boltzmann models in three dimensions, Philosophical Transactions: Mathematical, Physical and Engineering Sciences, pp.437-451, 2002.

T. Pohl, M. Kowarschik, J. Wilke, K. Iglberger, and U. Rüde, OPTIMIZATION AND PROFILING OF THE CACHE PERFORMANCE OF PARALLEL LATTICE BOLTZMANN CODES, Parallel Processing Letters, vol.13, issue.04, pp.549-560, 2003.
DOI : 10.1142/S0129626403001501

S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk et al., Optimization principles and application performance evaluation of a multithreaded GPU using CUDA, Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming , PPoPP '08, pp.73-82, 2008.
DOI : 10.1145/1345206.1345220

J. L. Henning and . Descriptions, SPEC CPU2006 benchmark descriptions, ACM SIGARCH Computer Architecture News, vol.34, issue.4, pp.1-17, 2006.
DOI : 10.1145/1186736.1186737

J. Habich, Performance Evaluation of Numeric Compute Kernels on nVIDIA GPUs

J. Tölke and M. Krafczyk, TeraFLOP computing on a desktop PC with GPUs for 3D CFD, International Journal of Computational Fluid Dynamics, vol.77, issue.7, pp.443-456, 2008.
DOI : 10.1002/cav.143

K. Martin and B. Hoffman, Mastering CMake, A Cross- Platform Build System, 2008.

W. J. Schroeder, K. Martin, L. S. Avila, and C. C. Law, The VTK User's Guide, 2006.

Q. Zou and X. , He, On pressure and velocity flow boundary conditions and bounceback for the lattice Boltzmann BGK model, Arxiv preprint comp-gas, 9611001.

S. Albensoeder and H. C. Kuhlmann, Accurate three-dimensional lid-driven cavity flow, Journal of Computational Physics, vol.206, issue.2, pp.536-558, 2005.
DOI : 10.1016/j.jcp.2004.12.024

M. Murphy, NVIDIA's Experience with Open64, nVidia