Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method

Zeiser T, Wellein G, Iglberger K, Rüde U, Hager G, Nitsure A (2008)


Publication Type: Journal article

Publication year: 2008

Journal

Publisher: Inderscience

Book Volume: 8

Pages Range: 179-188

Journal Issue: 1-4

DOI: 10.1504/PCFD.2008.018088

Abstract

In this report we propose a parallel cache oblivious spatial and temporal blocking algorithm for the lattice Boltzmann method in three spatial dimensions. The algorithm has originally been proposed by Frigo et al. (1999) and divides the space-time domain of stencil-based methods in an optimal way, independently of any external parameters, e.g., cache size. In view of the increasing gap between processor speed and memory performance this approach offers a promising path to increase cache utilisation. We find that even a straightforward cache oblivious implementation can reduce memory traffic at least by a factor of two if compared to a highly optimised standard kernel and improves scalability for shared memory parallelisation. Due to the recursive structure of the algorithm we use an unconventional parallelisation scheme based on task queuing. Copyright © 2008, Inderscience Publishers.

Authors with CRIS profile

How to cite

APA:

Zeiser, T., Wellein, G., Iglberger, K., Rüde, U., Hager, G., & Nitsure, A. (2008). Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method. Progress in Computational Fluid Dynamics, 8(1-4), 179-188. https://dx.doi.org/10.1504/PCFD.2008.018088

MLA:

Zeiser, Thomas, et al. "Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method." Progress in Computational Fluid Dynamics 8.1-4 (2008): 179-188.

BibTeX: Download