The Commit Size Distribution of Open Source Software

Arafat O, Riehle D (2009)


Publication Type: Conference contribution, Original article

Publication year: 2009

Publisher: IEEE Press

Edited Volumes: Proceedings of the 42nd Annual Hawaii International Conference on System Sciences, HICSS

Pages Range: 1-8

Conference Proceedings Title: Proceedings of the 42nd Hawaiian International Conference on System Sciences

Event location: Hawaii

URI: http://dirkriehle.com/wp-content/uploads/2008/09/hicss-42-csdistr-final-web.pdf

DOI: 10.1109/HICSS.2009.421

Abstract

With the growing economic importance of open source, we need to improve our understanding of how open source software development processes work. The analysis of code contributions to open source projects is an important part of such research. In this paper we analyze the size of code contributions to more than 9,000 open source projects. We review the total distribution and distinguish three categories of code contributions using a size-based heuristic: single focused commits, aggregate team contributions, and repository refactorings. We find that both the overall distribution and the individual categories follow a power law. We also suggest that distinguishing these commit categories by size will benefit future analyses. © 2009 IEEE.

Authors with CRIS profile

How to cite

APA:

Arafat, O., & Riehle, D. (2009). The Commit Size Distribution of Open Source Software. In Proceedings of the 42nd Hawaiian International Conference on System Sciences (pp. 1-8). Hawaii: IEEE Press.

MLA:

Arafat, Oliver, and Dirk Riehle. "The Commit Size Distribution of Open Source Software." Proceedings of the HICSS-42, Hawaii IEEE Press, 2009. 1-8.

BibTeX: Download