Sabih M, Hannig F, Teich J (2022)
Publication Language: English
Publication Type: Conference contribution, Conference Contribution
Publication year: 2022
Publisher: Association for Computing Machinery (ACM)
City/Town: New York, NY, United States
Pages Range: 109–115
Conference Proceedings Title: Proceedings of the 2nd European Workshop on Machine Learning and Systems (EuroMLSys)
Event location: Rennes, France
ISBN: 978-1-4503-9254-9
Filter pruning is one of the most effective ways to accelerate Convolutional Neural Networks (CNNs). Most of the existing works are focused on the static pruning of CNN filters. In dynamic pruning of CNN filters, existing works are based on the idea of switching between different branches of a CNN or exiting early based on the hardness of a sample. These approaches can reduce the average latency of inference, but they cannot reduce the longest-path latency of inference. In contrast, we present a novel approach of dynamic filter pruning that utilizes explainable AI along with early coarse prediction in the intermediate layers of a CNN. This coarse prediction is performed using a simple branch that is trained to perform top-k classification. The branch either predicts the output class with high confidence, in which case the rest of the computations are left out. Alternatively, the branch predicts the output class to be within a subset of possible output classes. After this coarse prediction, only those filters that are important for this subset of classes are then evaluated. The importances of filters for each output class are obtained using explainable AI. Using this concept of dynamic pruning, we are able not only to reduce the average latency of inference, but also the longest-path latency of inference. Our proposed architecture for dynamic pruning can be deployed on different hardware platforms.
APA:
Sabih, M., Hannig, F., & Teich, J. (2022). DyFiP: Explainable AI-based Dynamic Filter Pruning of Convolutional Neural Networks. In Proceedings of the 2nd European Workshop on Machine Learning and Systems (EuroMLSys) (pp. 109–115). Rennes, France, FR: New York, NY, United States: Association for Computing Machinery (ACM).
MLA:
Sabih, Muhammad, Frank Hannig, and Jürgen Teich. "DyFiP: Explainable AI-based Dynamic Filter Pruning of Convolutional Neural Networks." Proceedings of the 2nd European Workshop on Machine Learning and Systems (EuroMLSys), Rennes, France New York, NY, United States: Association for Computing Machinery (ACM), 2022. 109–115.
BibTeX: Download