Performance comparison of different parallel lattice Boltzmann implementations on multi-core multi-socket systems

Journal article


Publication Details

Author(s): Donath S, Iglberger K, Wellein G, Zeiser T, Nitsure A, Rüde U
Journal: International Journal of Computational Science and Engineering
Publisher: Inderscience Enterprises
Publication year: 2008
Volume: 4
Journal issue: 1
Pages range: 3-11
ISSN: 1742-7185
eISSN: 1742-7193
Language: English


Abstract


In this report, we discuss the performance behaviour of different parallel lattice Boltzmann implementations. In previous works, we already proposed a fast serial implementation and a cache oblivious spatial and temporal blocking algorithm for the lattice Boltzmann method (LBM) in three spatial dimensions. The cache oblivious update scheme has originally been proposed by Frigo et al. The main idea is to provide maximum performance results for stencil-based methods by dividing the space-time domain in an optimal way, independently of any external parameters, such as cache size. In view of the increasing gap between processor speed and memory performance, this approach offers a promising path to increase cache utilisation. We present results for the shared memory parallelisation of the cache oblivious implementation based on task queueing in comparison to the iterative standard implementation, thereby focusing on the special issues for multi-core and multi-socket systems. © 2008, Inderscience Publishers.



FAU Authors / FAU Editors

Donath, Stefan Dr.-Ing.
Lehrstuhl für Informatik 10 (Systemsimulation)
Iglberger, Klaus Dr.-Ing.
Zentralinstitut für Scientific Computing (ZISC)
Rüde, Ulrich Prof. Dr.
Lehrstuhl für Informatik 10 (Systemsimulation)
Wellein, Gerhard Prof. Dr.
Professur für Höchstleistungsrechnen
Zeiser, Thomas Dr.
Regionales Rechenzentrum Erlangen (RRZE)


How to cite

APA:
Donath, S., Iglberger, K., Wellein, G., Zeiser, T., Nitsure, A., & Rüde, U. (2008). Performance comparison of different parallel lattice Boltzmann implementations on multi-core multi-socket systems. International Journal of Computational Science and Engineering, 4(1), 3-11. https://dx.doi.org/10.1504/IJCSE.2008.021107

MLA:
Donath, Stefan, et al. "Performance comparison of different parallel lattice Boltzmann implementations on multi-core multi-socket systems." International Journal of Computational Science and Engineering 4.1 (2008): 3-11.

BibTeX: 

Last updated on 2018-09-08 at 14:38