Automatic Kernel Fusion for Image Processing DSLs

Qiao B, Reiche O, Hannig F, Teich J (2018)


Publication Language: English

Publication Type: Conference contribution, Original article

Publication year: 2018

Pages Range: 76-85

Conference Proceedings Title: Proceedings of the 21th International Workshop on Software and Compilers for Embedded Systems (SCOPES)

Event location: Sankt Goar DE

ISBN: 978-1-4503-5780-7

DOI: 10.1145/3207719.3207723

Abstract

Programming image processing algorithms on hardware accelerators such as graphics processing units (GPUs) often exhibits a trade-off between software portability and performance portability. Domain-specific languages (DSLs) have proven to be a promising remedy, which enable optimizations and generation of efficient code from a concise, high-level algorithm representation. The scope of this paper is an optimization framework for image processing DSLs in the form of a source-to-source compiler. To cope with the inter-kernel communication bound via global memory for GPU applications, kernel fusion is investigated as a primary optimization technique to improve temporal locality. In order to enable automatic kernel fusion, we analyze the fusibility of each kernel in the algorithm, in terms of data dependencies, resource utilization, and parallelism granularity. By combining the obtained information with the domain-specific knowledge captured in the DSL, a method to automatically fuse the suitable kernels is proposed and integrated into an open source DSL framework. The novel kernel fusion technique is evaluated on two filter-based image processing applications, for which speedups of up to 1.60 are obtained for an NVIDIA Geforce 745 graphics card target.

Authors with CRIS profile

Related research project(s)

How to cite

APA:

Qiao, B., Reiche, O., Hannig, F., & Teich, J. (2018). Automatic Kernel Fusion for Image Processing DSLs. In Proceedings of the 21th International Workshop on Software and Compilers for Embedded Systems (SCOPES) (pp. 76-85). Sankt Goar, DE.

MLA:

Qiao, Bo, et al. "Automatic Kernel Fusion for Image Processing DSLs." Proceedings of the 21st International Workshop on Software and Compilers for Embedded Systems (SCOPES), Sankt Goar 2018. 76-85.

BibTeX: Download