Efficiently measuring an accurate and generalized clone detection precision using clone clustering

2Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

An important measure of clone detection performance is precision. However, there has been a marked lack of research into methods of efficiently and accurately measuring the precision of a clone detection tool. Instead, tool authors simply validate a small random sample of the clones their tools detected in a subject software system. Since there could be many thousands of clones reported by the tool, such a small random sample cannot guarantee an accurate and generalized measure of the tool's precision for all the varieties of clones that can occur in any arbitrary software system. In this paper, we propose a machine-learning based approach that can cluster similar clones together, and which can be used to maximize the variety of clones examined when measuring precision, while significantly reducing the biases a specific subject system has on the generality of the precision measured. Our technique reduces the efforts in measuring precision, while doubling the variety of clones validated and reducing biases that harm the generality of the measure by up to an order of magnitude. Our case study with the NiCad clone detector and the Java class library shows that our approach is effective in efficiently measuring an accurate and generalized precision of a subject clone detection tool.

Cite

CITATION STYLE

APA

Svajlenko, J., & Roy, C. K. (2016). Efficiently measuring an accurate and generalized clone detection precision using clone clustering. In Proceedings of the International Conference on Software Engineering and Knowledge Engineering, SEKE (Vol. 2016-January, pp. 426–433). Knowledge Systems Institute Graduate School. https://doi.org/10.18293/SEKE2016-150

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free