Godenschwager C, Schornbaum F, Bauer M, Köstler H, Rüde U (2013)
Publication Language: English
Publication Type: Conference contribution
Publication year: 2013
Publisher: IEEE Computer Society
Edited Volumes: International Conference for High Performance Computing, Networking, Storage and Analysis, SC
Pages Range: 35-1 -- 35-12
Conference Proceedings Title: Proceedings of SC13: International Conference for High Performance Computing, Networking, Storage and Analysis
URI: https://www10.informatik.uni-erlangen.de/Publications/Papers/2013/Godenschwager_SC13.pdf
waLBerla is a massively parallel software framework for simulating complex ows with the lattice Boltzmann method (LBM). Performance and scalability results are presented for SuperMUC, the world's fastest x86-based supercomputer ranked number 6 on the Top500 list, and JUQUEEN, a Blue Gene/Q system ranked as number 5. We reach resolutions with more than one trillion cells and perform up to 1.93 trillion cell updates per second using 1.8 million threads. The design and implementation of waLBerla is driven by a careful analysis of the performance on current petascale supercomputers. Our fully distributed data structures and algorithms allow for efficient, massively parallel simulations on these machines. Elaborate node level optimizations and vectorization using SIMD instructions result in highly optimized compute kernels for the single- and two-relaxation-time LBM. Excellent weak and strong scaling is achieved for a complex vascular geometry of the human coronary tree. Copyright 2013 ACM.
APA:
Godenschwager, C., Schornbaum, F., Bauer, M., Köstler, H., & Rüde, U. (2013). A Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries. In Proceedings of SC13: International Conference for High Performance Computing, Networking, Storage and Analysis (pp. 35-1 -- 35-12). Denver, US: IEEE Computer Society.
MLA:
Godenschwager, Christian, et al. "A Framework for Hybrid Parallel Flow Simulations with a Trillion Cells in Complex Geometries." Proceedings of the SC13: International Conference for High Performance Computing, Networking, Storage and Analysis, Denver IEEE Computer Society, 2013. 35-1 -- 35-12.
BibTeX: Download