Co-Training for Commit Classification

6Citations
Citations of this article
47Readers
Mendeley users who have this article in their library.

Abstract

Commits in version control systems (e.g. Git) track changes in a software project. Commits comprise noisy user-generated natural language and code patches. Automatic commit classification (CC) has been used to determine the type of code maintenance activities performed, as well as to detect bug fixes in code repositories. Much prior work occurs in the fully-supervised setting – a setting that can be a stretch in resource-scarce situations presenting difficulties in labeling commits. In this paper, we apply co-training, a semi-supervised learning method, to take advantage of the two views available – the commit message (natural language) and the code changes (programming language) – to improve commit classification.

Cite

CITATION STYLE

APA

Yi, J., Lee, D., & Chieu, H. L. (2021). Co-Training for Commit Classification. In W-NUT 2021 - 7th Workshop on Noisy User-Generated Text, Proceedings of the Conference (pp. 389–395). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.wnut-1.43

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free