Recommending Good First Issues in GitHub OSS Projects

21Citations
Citations of this article
42Readers
Mendeley users who have this article in their library.

Abstract

Attracting and retaining newcomers is vital for the sustainability of an open-source software project. However, it is difficult for new-comers to locate suitable development tasks, while existing 'Good First Issues' (GFI) in GitHub are often insufficient and inappropriate. In this paper, we propose RECGFI, an effective practical approach for the recommendation of good first issues to newcomers, which can be used to relieve maintainers' burden and help newcomers onboard. RECGFI models an issue with features from multiple dimensions (content, background, and dynamics) and uses an XGBoost classifier to generate its probability of being a GFI. To evaluate RECGFI, we collect 53,510 resolved issues among 100 GitHub projects and care-fully restore their historical states to build ground truth datasets. Our evaluation shows that RECGFI can achieve up to 0.853 AUC in the ground truth dataset and outperforms alternative models. Our interpretable analysis of the trained model further reveals in-teresting observations about GFI characteristics. Finally, we report latest issues (without GFI-signaling labels but recommended as GFI by our approach) to project maintainers among which 16 are confirmed as real GFIs and five have been resolved by a newcomer.

References Powered by Scopus

Long Short-Term Memory

76931Citations
N/AReaders
Get full text

XGBoost: A scalable tree boosting system

32564Citations
N/AReaders
Get full text

The meaning and use of the area under a receiver operating characteristic (ROC) curve

17820Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Code Recommendation for Open Source Software Developers

8Citations
N/AReaders
Get full text

Quality Assurance Awareness in Open Source Software Projects on GitHub

6Citations
N/AReaders
Get full text

GFI-bot: automated good first issue recommendation on GitHub

6Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Xiao, W., He, H., Xu, W., Tan, X., Dong, J., & Zhou, M. (2022). Recommending Good First Issues in GitHub OSS Projects. In Proceedings - International Conference on Software Engineering (Vol. 2022-May, pp. 1830–1842). IEEE Computer Society. https://doi.org/10.1145/3510003.3510196

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 14

78%

Lecturer / Post doc 2

11%

Professor / Associate Prof. 1

6%

Researcher 1

6%

Readers' Discipline

Tooltip

Computer Science 10

53%

Neuroscience 6

32%

Engineering 2

11%

Chemistry 1

5%

Save time finding and organizing research with Mendeley

Sign up for free