Heidorn C, Meyerhöfer N, Schinabeck C, Hannig F, Teich J (2022)
Publication Language: English
Publication Type: Conference contribution, Conference Contribution
Publication year: 2022
Publisher: Springer Nature
City/Town: Switzerland
Pages Range: 283 - 299
Event location: Pythagoreio, Samos
ISBN: 978-3-031-15073-9
DOI: 10.1007/978-3-031-15074-6_18
Compression techniques for Convolutional Neural Networks (CNNs) are key to performance. One common technique is filter pruning, which can effectively reduce the memory footprint, number of arithmetic operations, and consequently inference time. Recently, several approaches have been presented for automatic CNN compression using filter pruning, where the number of pruned filters is optimized by nature-inspired metaheuristics (e.g., artificial bee colony algorithms). However, these approaches focus on finding an optimal pruned network structure without considering the targeted device for CNN deployment. In this work, we show that the typical objective of reducing the number of operations does not necessarily lead to a maximum reduction in inference time, which is usually the main goal for compressing CNNs besides reducing the memory footprint. We then propose a hardware-aware multi-objective Design Space Exploration (DSE) technique for filter pruning that involves the targeted device (i.e., Graphics Processing Units (GPUs)). For each layer, the number of filters to be pruned is optimized with the objectives of minimizing the inference time and
the error rate of the CNN. Experimental results show that our approach can further speed up inference time by 1.24× and 1.09× for VGG-16 on the CIFAR-10 dataset and ResNet-101 on the ILSVRC-2012 dataset, respectively, compared to the state-of-the-art ABCPruner.
APA:
Heidorn, C., Meyerhöfer, N., Schinabeck, C., Hannig, F., & Teich, J. (2022). Hardware-Aware Evolutionary Filter Pruning. In Springer, Cham (Eds.), Proceedings of the International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS XXII) (pp. 283 - 299). Pythagoreio, Samos, GR: Switzerland: Springer Nature.
MLA:
Heidorn, Christian, et al. "Hardware-Aware Evolutionary Filter Pruning." Proceedings of the International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS XXII), Pythagoreio, Samos Ed. Springer, Cham, Switzerland: Springer Nature, 2022. 283 - 299.
BibTeX: Download