Speeding up a Lattice Boltzmann Kernel on nVIDIA GPUs

Conference contribution


Publication Details

Author(s): Habich J, Zeiser T, Hager G, Wellein G
Title edited volumes: Civil-Comp Proceedings
Publisher: Civil-Comp Press
Publishing place: Kippen, Stirlingshire, United Kingdom
Publication year: 2009
Conference Proceedings Title: Proceedings of the First International Conference on Parallel, Distributed and Grid Computing for Engineering
Pages range: 17
ISBN: 978-1-905088-29-4


Abstract


This paper presents implementation strategies and optimization approaches for a lattice Boltzmann flow solver on nVIDIA graphics processing units (GPUs). We outline the GPU-specific implementation of the flow solver, potential problems that occur and discuss how to avoid or at least ameliorate them. It is shown that ten times the performance of a quad core based two socket server can be achieved with a single graphics card. Furthermore we employ STREAM benchmarks to evaluate achievable performance and to establish programming guidelines for applications. With regard to hybrid computing the attainable bandwidth of different interfaces involved is evaluated. © Civil-Comp Press, 2009.



FAU Authors / FAU Editors

Habich, Johannes
Regionales Rechenzentrum Erlangen (RRZE)
Hager, Georg Dr.
Regionales Rechenzentrum Erlangen (RRZE)
Wellein, Gerhard Prof. Dr.
Professur für Höchstleistungsrechnen
Zeiser, Thomas Dr.
Regionales Rechenzentrum Erlangen (RRZE)


How to cite

APA:
Habich, J., Zeiser, T., Hager, G., & Wellein, G. (2009). Speeding up a Lattice Boltzmann Kernel on nVIDIA GPUs. In Proceedings of the First International Conference on Parallel, Distributed and Grid Computing for Engineering (pp. 17). Pécs, Hungary, HU: Kippen, Stirlingshire, United Kingdom: Civil-Comp Press.

MLA:
Habich, Johannes, et al. "Speeding up a Lattice Boltzmann Kernel on nVIDIA GPUs." Proceedings of the PARENG2009, Pécs, Hungary Kippen, Stirlingshire, United Kingdom: Civil-Comp Press, 2009. 17.

BibTeX: 

Last updated on 2018-09-08 at 22:40