Optimization and Profiling of the Cache Performance of Parallel Lattice Boltzmann Codes

Beitrag in einer Fachzeitschrift


Details zur Publikation

Autor(en): Pohl T, Kowarschik M, Wilke J, Iglberger K, Rüde U
Zeitschrift: Parallel Processing Letters
Verlag: World Scientific Publishing Co
Jahr der Veröffentlichung: 2003
Band: 13
Heftnummer: 4
Seitenbereich: 549-560
ISSN: 0129-6264


Abstract

When designing and implementing highly efficient scientific applications for parallel computers such as clusters of workstations, it is inevitable to consider and to optimize the single-CPU performance of the codes. For this purpose, it is particularly important that the codes respect the hierarchical memory designs that computer architects employ in order to hide the effects of the growing gap between CPU performance and main memory speed. In this article, we present techniques to enhance the single-CPU efficiency of lattice Boltzmann methods which are commonly used in computational fluid dynamics. We show various performance results for both 2D and 3D codes in order to emphasize the effectiveness of our optimization techniques.


FAU-Autoren / FAU-Herausgeber

Iglberger, Klaus Dr.-Ing.
Zentralinstitut für Scientific Computing (ZISC)
Rüde, Ulrich Prof. Dr.
Lehrstuhl für Informatik 10 (Systemsimulation)


Zitierweisen

APA:
Pohl, T., Kowarschik, M., Wilke, J., Iglberger, K., & Rüde, U. (2003). Optimization and Profiling of the Cache Performance of Parallel Lattice Boltzmann Codes. Parallel Processing Letters, 13(4), 549-560. https://dx.doi.org/10.1142/S0129626403001501

MLA:
Pohl, Thomas, et al. "Optimization and Profiling of the Cache Performance of Parallel Lattice Boltzmann Codes." Parallel Processing Letters 13.4 (2003): 549-560.

BibTeX: 

Zuletzt aktualisiert 2018-21-08 um 13:38