Kreutzer P, Dotzler G, Ring M, Eskofier B, Philippsen M (2016)
Publication Language: English
Publication Type: Conference contribution, Original article
Publication year: 2016
Pages Range: 61-72
Conference Proceedings Title: Proceedings of the 13th International Conference on Mining Software Repositories (MSR'16)
Event location: Austin, TX, USA
ISBN: 978-1-4503-4186-8
URI: http://dl.acm.org/citation.cfm?id=2901749
Several research tools and projects require groups of similar code changes as input. Examples are recommendation and bug finding tools that can provide valuable information to developers based on such data. With the help of similar code changes they can simplify the application of bug fixes and code changes to multiple locations in a project. But despite their benefit, the practical value of existing tools is limited, as users need to manually specify the input data, i.e., the groups of similar code changes.
To overcome this drawback, this paper presents and evaluates two syntactical similarity metrics, one of them is specifically designed to run fast, in combination with two carefully selected and self-tuning clustering algorithms to automatically detect groups of similar code changes.
We evaluate the combinations of metrics and clustering algorithms by applying them to several open source projects and also publish the detected groups of similar code changes online as a reference dataset. The automatically detected groups of similar code changes work well when used as input for LASE, a recommendation system for code changes.
APA:
Kreutzer, P., Dotzler, G., Ring, M., Eskofier, B., & Philippsen, M. (2016). Automatic clustering of code changes. In Proceedings of the 13th International Conference on Mining Software Repositories (MSR'16) (pp. 61-72). Austin, TX, USA, US.
MLA:
Kreutzer, Patrick, et al. "Automatic clustering of code changes." Proceedings of the 13th International Conference on Mining Software Repositories (MSR 2016), Austin, TX, USA 2016. 61-72.
BibTeX: Download