Performance analysis and optimization strategies for a D3Q19 lattice Boltzmann kernel on nVIDIA GPUs using CUDA

Conference contribution


Publication Details

Author(s): Habich J, Zeiser T, Hager G, Wellein G
Edition name or number: 5
Title edited volumes: Advances in Engineering Software
Publisher: Elsevier
Publishing place: ScienceDirect
Publication year: 2011
Volume: 42
Conference Proceedings Title: Advances in Engineering Software
Pages range: 266-272
ISSN: 0965-9978
Language: English


Abstract


This paper presents implementation strategies and optimization approaches for a D3Q19 lattice Boltzmann flow solver on nVIDIA graphics processing units (GPUs). Using the STREAM benchmarks we demonstrate the GPU parallelization approach and obtain an upper limit for the flow solver performance. We discuss the GPU-specific implementation of the solver with a focus on memory alignment and register shortage. The optimized code is up to an order of magnitude faster than standard two-socket x86 servers with AMD Barcelona or Intel Nehalem CPUs. We further analyze data transfer rates for the PCI-express bus to evaluate the potential benefits of multi-GPU parallelism in a cluster environment. © 2010 Civil-Comp Ltd and Elsevier Ltd. All rights reserved.



FAU Authors / FAU Editors

Habich, Johannes
Regionales Rechenzentrum Erlangen (RRZE)
Hager, Georg Dr.
Regionales Rechenzentrum Erlangen (RRZE)
Wellein, Gerhard Prof. Dr.
Professur für Höchstleistungsrechnen
Zeiser, Thomas Dr.
Regionales Rechenzentrum Erlangen (RRZE)


How to cite

APA:
Habich, J., Zeiser, T., Hager, G., & Wellein, G. (2011). Performance analysis and optimization strategies for a D3Q19 lattice Boltzmann kernel on nVIDIA GPUs using CUDA. In Advances in Engineering Software (pp. 266-272). ScienceDirect: Elsevier.

MLA:
Habich, Johannes, et al. "Performance analysis and optimization strategies for a D3Q19 lattice Boltzmann kernel on nVIDIA GPUs using CUDA." Proceedings of the PARENG 2009 ScienceDirect: Elsevier, 2011. 266-272.

BibTeX: 

Last updated on 2018-09-08 at 22:55