Performance-Optimierung für zukünftige Hardware

Internally funded project

Project Details

Project leader:
Prof. Dr. Ulrich Rüde

Project members:
Prof. Dr. Harald Köstler

Contributing FAU Organisations:
Lehrstuhl für Informatik 10 (Systemsimulation)
Technische Fakultät

Start date: 01/06/2006

Abstract (technical / expert description):

To get high performance in numerical applications, characteristics of the executing platform have to be considered. The number of cores per CPU are likely to further increase, and heterogeneous, accelerated architectures to establish, but even reconfigurable units are possible. We therefore investigate and develop performance optimization techniques on modern and alternative hardware, like for instance multicore systems, GPGPUs, and other accelerator architectures.


Stürmer, M., Köstler, H., & Rüde, U. (2012). Fast wavelet transform utilizing a multicore-aware framework. In Jonasson K (Eds.), Applied Parallel and Scientific Computing. (pp. 313-323). Berlin, Heidelberg, New York: Springer-verlag.
Ritter, D., Stürmer, M., & Rüde, U. (2010). A fast-adaptive composite grid algorithm for solving the free-space Poisson problem on the cell broadband engine. Numerical Linear Algebra With Applications, 17(2-3), 291-305.
Bartuschat, D., Stürmer, M., & Köstler, H. (2010). An Orthogonal Matching Pursuit Algorithm for Image Denoising on the Cell Broadband Engine. In Parallel Processing and Applied Mathematics (pp. 557-566). Wroclaw: Berlin Heidelberg: Springer-verlag.
Stürmer, M., Köstler, H., & Rüde, U. (2010). Optimized fast wavelet transform utilizing a multicore-aware framework for stencil computations. In Proceedings of Para 2010 (pp. 1-4). Reykjavik, Iceland, IS.
Stürmer, M., Wellein, G., Hager, G., Köstler, H., & Rüde, U. (2009). Challenges and Potentials of Emerging Multicore Architectures. In High Performance Computing in Science and Engineering Garching-Munich 2007 (pp. 551-566). Garching: Berlin Heidelberg: Springer.
Stürmer, M., Götz, J., Richter, G., Dörfler, A., & Rüde, U. (2009). Fluid flow simulation on the Cell Broadband Engine using the lattice Boltzmann method. Computers and Mathematics with Applications, 58(5), 1062-1070.
Stürmer, M., Eitzinger, J., & Rüde, U. (2008). Optimising a 3D multigrid algorithm for the IA-64 architecture. International Journal of Computational Science and Engineering, 4(1), 29-35.

Last updated on 2019-15-03 at 10:31