Scalable Lattice Boltzmann Solvers for CUDA GPU Clusters

Christian Obrecht 1, * Frédéric Kuznik 1 Bernard Tourancheau 2 Jean-Jacques Roux 1
* Corresponding author
2 Drakkar
LIG - Laboratoire d'Informatique de Grenoble
Abstract : The lattice Boltzmann method (LBM) is an innovative and promising approach in computational fluid dynamics. From an algorithmic standpoint it reduces to a regular data parallel procedure and is therefore well-suited to high performance computations. Numerous works report efficient implementations of the LBM for the GPU, but very few mention multi-GPU versions and even fewer GPU cluster implementations. Yet, to be of practical interest, GPU LBM solvers need to be able to perform large scale simulations. In the present contribution, we describe an efficient LBM implementation for CUDA GPU clusters. Our solver consists of a set of MPI communication routines and a CUDA kernel specifically designed to handle three-dimensional partitioning of the computation domain. Performance measurement were carried out on a small cluster. We show that the results are satisfying, both in terms of data throughput and parallelisation efficiency.
Document type :
Journal articles
Liste complète des métadonnées

Cited literature [15 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00931058
Contributor : Valérie Samper <>
Submitted on : Wednesday, June 11, 2014 - 10:13:18 AM
Last modification on : Monday, December 10, 2018 - 10:54:01 AM
Document(s) archivé(s) le : Thursday, September 11, 2014 - 10:40:41 AM

File

ACL35.pdf
Files produced by the author(s)

Identifiers

Citation

Christian Obrecht, Frédéric Kuznik, Bernard Tourancheau, Jean-Jacques Roux. Scalable Lattice Boltzmann Solvers for CUDA GPU Clusters. Parallel Computing, Elsevier, 2013, 39 (6-7), pp.259-270. ⟨10.1016/j.parco.2013.04.001⟩. ⟨hal-00931058⟩

Share

Metrics

Record views

408

Files downloads

450