On the Shoulders of Giants: A New Dataset for Pull-based Development Research

10Citations
Citations of this article
20Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Pull-based development is a widely adopted paradigm for collaboration in distributed software development, attracting eyeballs from both academic and industry. To better study pull-based development model, this paper presents a new dataset containing 96 features collected from 11,230 projects and 3,347,937 pull requests. We describe the creation process and explain the features in details. To the best of our knowledge, our dataset is the most comprehensive and largest one toward a complete picture for pull-based development research.

Cite

CITATION STYLE

APA

Zhang, X., Rastogi, A., & Yu, Y. (2020). On the Shoulders of Giants: A New Dataset for Pull-based Development Research. In Proceedings - 2020 IEEE/ACM 17th International Conference on Mining Software Repositories, MSR 2020 (pp. 543–547). Association for Computing Machinery, Inc. https://doi.org/10.1145/3379597.3387489

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free