Automatic Kernel Fusion for Image Processing DSLs

Beitrag bei einer Tagung
(Originalarbeit)


Details zur Publikation

Autor(en): Qiao B, Reiche O, Hannig F, Teich J
Jahr der Veröffentlichung: 2018
Tagungsband: Proceedings of the 21th International Workshop on Software and Compilers for Embedded Systems
Seitenbereich: 76-85
ISBN: 978-1-4503-5780-7
Sprache: Englisch


Abstract


Programming image processing algorithms on hardware acceler-


ators such as graphics processing units (GPUs) often exhibits a


trade-off between software portability and performance portability.


Domain-specific languages (DSLs) have proven to be a promising


remedy, which enable optimizations and generation of efficient


code from a concise, high-level algorithm representation.


The scope of this paper is an optimization framework for image


processing DSLs in the form of a source-to-source compiler. To cope


with the inter-kernel communication bound via global memory for


GPU applications, kernel fusion is investigated as a primary opti-


mization technique to improve temporal locality. In order to enable


automatic kernel fusion, we analyze the fusibility of each kernel in


the algorithm, in terms of data dependencies, resource utilization,


and parallelism granularity. By combining the obtained informa-


tion with the domain-specific knowledge captured in the DSL, a


method to automatically fuse the suitable kernels is proposed and


integrated into an open source DSL framework. The novel kernel


fusion technique is evaluated on two filter-based image processing


applications, for which speedups of up to 1.60 are obtained for an


NVIDIA Geforce 745 graphics card target.


FAU-Autoren / FAU-Herausgeber

Hannig, Frank PD Dr.-Ing.
Lehrstuhl für Informatik 12 (Hardware-Software-Co-Design)
Qiao, Bo
Lehrstuhl für Informatik 12 (Hardware-Software-Co-Design)
Reiche, Oliver
Lehrstuhl für Informatik 12 (Hardware-Software-Co-Design)
Teich, Jürgen Prof. Dr.-Ing.
Lehrstuhl für Informatik 12 (Hardware-Software-Co-Design)


Zitierweisen

APA:
Qiao, B., Reiche, O., Hannig, F., & Teich, J. (2018). Automatic Kernel Fusion for Image Processing DSLs. In Proceedings of the 21th International Workshop on Software and Compilers for Embedded Systems (pp. 76-85). Sankt Goar, DE.

MLA:
Qiao, Bo, et al. "Automatic Kernel Fusion for Image Processing DSLs." Proceedings of the 21st International Workshop on Software and Compilers for Embedded Systems (SCOPES 2018), Sankt Goar 2018. 76-85.

BibTeX: 

Zuletzt aktualisiert 2018-16-08 um 18:42