A flexible Patch-based lattice Boltzmann parallelization approach for heterogeneous GPU-CPU clusters

Journal article
(Original article)


Publication Details

Author(s): Feichtinger C, Köstler H, Hager G, Rüde U, Wellein G, Habich J
Journal: Parallel Computing
Publisher: Elsevier
Publication year: 2011
Volume: 37
Journal issue: 9
Pages range: 536-549
ISSN: 0167-8191
Language: English


Abstract


Sustaining a large fraction of single GPU performance in parallel computations is considered to be the major problem of GPU-based clusters. We address this issue in the context of a lattice Boltzmann flow solver that is integrated in the WaLBerla software framework. Our multi-GPU implementation uses a block-structured MPI parallelization and is suitable for load balancing and heterogeneous computations on CPUs and GPUs. The overhead required for multi-GPU simulations is discussed in detail. It is demonstrated that a large fraction of the kernel performance can be sustained for weak scaling on InfiniBand clusters, leading to excellent parallel efficiency. However, in strong scaling scenarios using multiple GPUs is much less efficient than running CPU-only simulations on IBM BG/P and x86-based clusters. Hence, a cost analysis must determine the best course of action for a particular simulation task and hardware configuration. Finally we present weak scaling results of heterogeneous simulations conducted on CPUs and GPUs simultaneously, using clusters equipped with varying node configurations. © 2011 Elsevier B.V. All rights reserved.



FAU Authors / FAU Editors

Feichtinger, Christian Dr.-Ing.
Lehrstuhl für Informatik 10 (Systemsimulation)
Hager, Georg Dr.
Regionales Rechenzentrum Erlangen (RRZE)
Köstler, Harald Prof. Dr.
Lehrstuhl für Informatik 10 (Systemsimulation)
Rüde, Ulrich Prof. Dr.
Lehrstuhl für Informatik 10 (Systemsimulation)
Wellein, Gerhard Prof. Dr.
Professur für Höchstleistungsrechnen


Additional Organisation
Exzellenz-Cluster Engineering of Advanced Materials


Research Fields

A3 Multiscale Modeling and Simulation
Exzellenz-Cluster Engineering of Advanced Materials


How to cite

APA:
Feichtinger, C., Köstler, H., Hager, G., Rüde, U., Wellein, G., & Habich, J. (2011). A flexible Patch-based lattice Boltzmann parallelization approach for heterogeneous GPU-CPU clusters. Parallel Computing, 37(9), 536-549. https://dx.doi.org/10.1016/j.parco.2011.03.005

MLA:
Feichtinger, Christian, et al. "A flexible Patch-based lattice Boltzmann parallelization approach for heterogeneous GPU-CPU clusters." Parallel Computing 37.9 (2011): 536-549.

BibTeX: 

Last updated on 2018-09-08 at 16:08

Share link