DyFiP: Explainable AI-based Dynamic Filter Pruning of Convolutional Neural Networks

Sabih M, Hannig F, Teich J (2022)


Publication Language: English

Publication Type: Conference contribution, Conference Contribution

Publication year: 2022

Publisher: Association for Computing Machinery (ACM)

City/Town: New York, NY, United States

Pages Range: 109–115

Conference Proceedings Title: Proceedings of the 2nd European Workshop on Machine Learning and Systems (EuroMLSys)

Event location: Rennes, France FR

ISBN: 978-1-4503-9254-9

DOI: 10.1145/3517207.3526982

Abstract

Filter pruning is one of the most effective ways to accelerate Convolutional Neural Networks (CNNs). Most of the existing works are focused on the static pruning of CNN filters. In dynamic pruning of CNN filters, existing works are based on the idea of switching between different branches of a CNN or exiting early based on the hardness of a sample. These approaches can reduce the average latency of inference, but they cannot reduce the longest-path latency of inference. In contrast, we present a novel approach of dynamic filter pruning that utilizes explainable AI along with early coarse prediction in the intermediate layers of a CNN. This coarse prediction is performed using a simple branch that is trained to perform top-k classification. The branch either predicts the output class with high confidence, in which case the rest of the computations are left out. Alternatively, the branch predicts the output class to be within a subset of possible output classes. After this coarse prediction, only those filters that are important for this subset of classes are then evaluated. The importances of filters for each output class are obtained using explainable AI. Using this concept of dynamic pruning, we are able not only to reduce the average latency of inference, but also the longest-path latency of inference. Our proposed architecture for dynamic pruning can be deployed on different hardware platforms.

Authors with CRIS profile

Related research project(s)

How to cite

APA:

Sabih, M., Hannig, F., & Teich, J. (2022). DyFiP: Explainable AI-based Dynamic Filter Pruning of Convolutional Neural Networks. In Proceedings of the 2nd European Workshop on Machine Learning and Systems (EuroMLSys) (pp. 109–115). Rennes, France, FR: New York, NY, United States: Association for Computing Machinery (ACM).

MLA:

Sabih, Muhammad, Frank Hannig, and Jürgen Teich. "DyFiP: Explainable AI-based Dynamic Filter Pruning of Convolutional Neural Networks." Proceedings of the 2nd European Workshop on Machine Learning and Systems (EuroMLSys), Rennes, France New York, NY, United States: Association for Computing Machinery (ACM), 2022. 109–115.

BibTeX: Download