Kerncraft: A Tool for Analytic Performance Modeling of Loop Kernels

Beitrag bei einer Tagung
(Konferenzbeitrag)


Details zur Publikation

Autorinnen und Autoren: Hammer J, Eitzinger J, Hager G, Wellein G
Herausgeber: Niethammer C, Gracia J, Hilbrich T, Knüpfer A, Resch MM, Nagel WE
Verlag: Springer International Publishing
Verlagsort: Cham
Jahr der Veröffentlichung: 2017
Tagungsband: Tools for High Performance Computing 2016
Seitenbereich: 1--22
ISBN: 978-3-319-56702-0


Abstract

Achieving optimal program performance requires deep insight into the interaction between hardware and software. For software developers without an in-depth background in computer architecture, understanding and fully utilizing modern architectures is close to impossible. Analytic loop performance modeling is a useful way to understand the relevant bottlenecks of code execution based on simple machine models. The Roofline Model and the Execution-Cache-Memory (ECM) model are proven approaches to performance modeling of loop nests. In comparison to the Roofline model, the ECM model can also describes the single-core performance and saturation behavior on a multicore chip.We give an introduction to the Roofline and ECM models, and to stencil performance modeling using layer conditions (LC). We then present Kerncraft, a tool that can automatically construct Roofline and ECM models for loop nests by performing the required code, data transfer, and LC analysis. The layer condition analysis allows to predict optimal spatial blocking factors for loop nests. Together with the models it enables an ab-initio estimate of the potential benefits of loop blocking optimizations and of useful block sizes. In cases where LC analysis is not easily possible, Kerncraft supports a cache simulator as a fallback option. Using a 25-point long-range stencil we demonstrate the usefulness and predictive power of the Kerncraft tool.


FAU-Autorinnen und Autoren / FAU-Herausgeberinnen und Herausgeber

Eitzinger, Jan Dr.
Regionales Rechenzentrum Erlangen (RRZE)
Hager, Georg Dr.
Regionales Rechenzentrum Erlangen (RRZE)
Hammer, Julian
Professur für Höchstleistungsrechnen
Wellein, Gerhard Prof. Dr.
Professur für Höchstleistungsrechnen


Forschungsbereiche

Werkzeuge für Performancemodellierung und Performanceanalyse
Professur für Höchstleistungsrechnen


Zitierweisen

APA:
Hammer, J., Eitzinger, J., Hager, G., & Wellein, G. (2017). Kerncraft: A Tool for Analytic Performance Modeling of Loop Kernels. In Niethammer C, Gracia J, Hilbrich T, Knüpfer A, Resch MM, Nagel WE (Eds.), Tools for High Performance Computing 2016 (pp. 1--22). Stuttgart, Germany: Cham: Springer International Publishing.

MLA:
Hammer, Julian, et al. "Kerncraft: A Tool for Analytic Performance Modeling of Loop Kernels." Proceedings of the 10th International Workshop on Parallel Tools for High Performance Computing, Stuttgart, Germany Ed. Niethammer C, Gracia J, Hilbrich T, Knüpfer A, Resch MM, Nagel WE, Cham: Springer International Publishing, 2017. 1--22.

BibTeX: 

Zuletzt aktualisiert 2018-19-08 um 16:38