Sigl M (2019)
Publication Language: English
Publication Status: Accepted
Publication Type: Conference contribution, Conference Contribution
Future Publication Type: Conference contribution
Publication year: 2019
Series: 35th IEEE International Conference on Data Engineering, ICDE 2019, Macau SAR, China, April 8 - 11, 2019
Edition: 35
Event location: Macau SAR, China
Training of Deep-Learning models represents a computationally intensive task, especially with high-dimensional and high-volume data, where training may take days or weeks.
Current research mainly focuses on model materialization, workflow optimization, and lifecylce management. However, searching for a similar dataset in a dataset repository to reuse a suitable pre-built model has received little attention by the data-management community.
A remarkable feature of reusing an already built model is that the iterative process of finding a suitable model is shortened, and additionally, only a small set of training data is required. Thus, transferring such knowledge has become an important issue in the Deep-Learning community. However, the relationship between the source and target task is not well understood yet.
The aim of this research is to reduce training time of machine learning from a data-management perspective through model reuse, and shed some light on the above relationship in the case when reusing a model is appropriate. We propose a framework to aid data scientists in searching for a similar dataset given a new dataset and selecting a suitable model. We discuss a novel approach on how to determine similarity among datasets and how to chose an appropriate model. We are confident that with recent advances of model materialization and dataset repositories our research improves the knowledge on this topic.
APA:
Sigl, M. (2019). Don't Fear the REAPER: A Framework for Materializing and Reusing Deep-Learning Models. In IEEE Computer Society (Eds.), Proceedings of the International Conference on Data Engineering. Macau SAR, China, MO.
MLA:
Sigl, Melanie. "Don't Fear the REAPER: A Framework for Materializing and Reusing Deep-Learning Models." Proceedings of the International Conference on Data Engineering, Macau SAR, China Ed. IEEE Computer Society, 2019.
BibTeX: Download