Pull-based development is a widely adopted paradigm for collaboration in distributed software development, attracting eyeballs from both academic and industry. To better study pull-based development model, this paper presents a new dataset containing 96 features collected from 11,230 projects and 3,347,937 pull requests. We describe the creation process and explain the features in details. To the best of our knowledge, our dataset is the most comprehensive and largest one toward a complete picture for pull-based development research.
CITATION STYLE
Zhang, X., Rastogi, A., & Yu, Y. (2020). On the Shoulders of Giants: A New Dataset for Pull-based Development Research. In Proceedings - 2020 IEEE/ACM 17th International Conference on Mining Software Repositories, MSR 2020 (pp. 543–547). Association for Computing Machinery, Inc. https://doi.org/10.1145/3379597.3387489
Mendeley helps you to discover research relevant for your work.