Rodriguez Salas D, Mürschberger N, Ravikumar N, Seuret M, Maier A (2020)
Publication Language: English
Publication Type: Journal article, Original article
Publication year: 2020
Book Volume: 1
Article Number: 252
Journal Issue: 5
DOI: 10.1007/s42979-020-00268-y
Open Access Link: https://doi.org/10.1007/s42979-020-00268-y
Tree-based classifiers provide easy-to-understand outputs. Artificial neural networks (ANN) commonly outperform tree-based classifiers; nevertheless, understanding their outputs requires specialized knowledge in most cases. The highly redundant architecture of ANN is typically designed through an expensive trial-and-error scheme. We aim at (1) investigating whether using ensembles of decision trees to design the architecture of low-redundant, sparse ANN provides better-performing networks, and (2) evaluating whether such trees can be used to provide human-understandable explanations for their outputs. Information about the hierarchy of the features, and how good they are at separating subsets of samples among the classes, is gathered from each branch in an ensemble of trees. This information is used to design the architecture of a sparse multilayer perceptron network. Networks built using our method are called ForestNet. Tree branches corresponding to highly activated neurons are used to provide explanations of the networks’ outputs. ForestNets are able to handle low- and high-dimensional data, as we show on an evaluation using four datasets. Our networks consistently outperformed their respective ensemble of trees and had similar performance to their fully connected counterparts with a significant reduction of connections. Furthermore, our interpretation method seems to provide support for the ForestNet outputs. While ForestNet’s architectures do not allow them yet to capture well the intrinsic variability of visual data, they exhibit very promising results by reducing more than 98% of connections for such visual tasks. Structure similarities between ForestNets and their respective tree ensemble provide means to interpret their outputs.
APA:
Rodriguez Salas, D., Mürschberger, N., Ravikumar, N., Seuret, M., & Maier, A. (2020). Mapping Ensembles of Trees to Sparse, Interpretable Multilayer Perceptron Networks. SN Computer Science, 1(5). https://doi.org/10.1007/s42979-020-00268-y
MLA:
Rodriguez Salas, Dalia, et al. "Mapping Ensembles of Trees to Sparse, Interpretable Multilayer Perceptron Networks." SN Computer Science 1.5 (2020).
BibTeX: Download