Hardware-Aware Evolutionary Filter Pruning

Heidorn C, Meyerhöfer N, Schinabeck C, Hannig F, Teich J (2022)


Publication Language: English

Publication Type: Conference contribution, Conference Contribution

Publication year: 2022

Publisher: Springer Nature

City/Town: Switzerland

Pages Range: 283 - 299

Event location: Pythagoreio, Samos GR

ISBN: 978-3-031-15073-9

DOI: 10.1007/978-3-031-15074-6_18

Abstract

Compression techniques for Convolutional Neural Networks (CNNs) are key to performance. One common technique is filter pruning, which can effectively reduce the memory footprint, number of arithmetic operations, and consequently inference time. Recently, several approaches have been presented for automatic CNN compression using filter pruning, where the number of pruned filters is optimized by nature-inspired metaheuristics (e.g., artificial bee colony algorithms). However, these approaches focus on finding an optimal pruned network structure without considering the targeted device for CNN deployment. In this work, we show that the typical objective of reducing the number of operations does not necessarily lead to a maximum reduction in inference time, which is usually the main goal for compressing CNNs besides reducing the memory footprint. We then propose a hardware-aware multi-objective Design Space Exploration (DSE) technique for filter pruning that involves the targeted device (i.e., Graphics Processing Units (GPUs)). For each layer, the number of filters to be pruned is optimized with the objectives of minimizing the inference time and
the error rate of the CNN. Experimental results show that our approach can further speed up inference time by 1.24× and 1.09× for VGG-16 on the CIFAR-10 dataset and ResNet-101 on the ILSVRC-2012 dataset, respectively, compared to the state-of-the-art ABCPruner.

Authors with CRIS profile

Related research project(s)

Involved external institutions

How to cite

APA:

Heidorn, C., Meyerhöfer, N., Schinabeck, C., Hannig, F., & Teich, J. (2022). Hardware-Aware Evolutionary Filter Pruning. In Springer, Cham (Eds.), Proceedings of the International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS XXII) (pp. 283 - 299). Pythagoreio, Samos, GR: Switzerland: Springer Nature.

MLA:

Heidorn, Christian, et al. "Hardware-Aware Evolutionary Filter Pruning." Proceedings of the International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS XXII), Pythagoreio, Samos Ed. Springer, Cham, Switzerland: Springer Nature, 2022. 283 - 299.

BibTeX: Download