Habich J, Zeiser T, Hager G, Wellein G (2009)
Publication Type: Conference contribution
Publication year: 2009
Publisher: Civil-Comp Press
Edited Volumes: Civil-Comp Proceedings
City/Town: Kippen, Stirlingshire, United Kingdom
Pages Range: 17
Conference Proceedings Title: Proceedings of the First International Conference on Parallel, Distributed and Grid Computing for Engineering
ISBN: 978-1-905088-29-4
This paper presents implementation strategies and optimization approaches for a lattice Boltzmann flow solver on nVIDIA graphics processing units (GPUs). We outline the GPU-specific implementation of the flow solver, potential problems that occur and discuss how to avoid or at least ameliorate them. It is shown that ten times the performance of a quad core based two socket server can be achieved with a single graphics card. Furthermore we employ STREAM benchmarks to evaluate achievable performance and to establish programming guidelines for applications. With regard to hybrid computing the attainable bandwidth of different interfaces involved is evaluated. © Civil-Comp Press, 2009.
APA:
Habich, J., Zeiser, T., Hager, G., & Wellein, G. (2009). Speeding up a Lattice Boltzmann Kernel on nVIDIA GPUs. In Proceedings of the First International Conference on Parallel, Distributed and Grid Computing for Engineering (pp. 17). Pécs, Hungary, HU: Kippen, Stirlingshire, United Kingdom: Civil-Comp Press.
MLA:
Habich, Johannes, et al. "Speeding up a Lattice Boltzmann Kernel on nVIDIA GPUs." Proceedings of the PARENG2009, Pécs, Hungary Kippen, Stirlingshire, United Kingdom: Civil-Comp Press, 2009. 17.
BibTeX: Download