Functional code clone detection with syntax and semantics fusion learning

Chunrong Fang; Zixi Liu; Yangyang Shi; Jeff Huang; Qingkai Shi

Conference ProceedingsOPEN ACCESS

Functional code clone detection with syntax and semantics fusion learning

ISSTA 2020 - Proceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis (2020) 516-527

DOI: 10.1145/3395363.3397362

132Citations

97Readers

Get full text

Abstract

Clone detection of source code is among the most fundamental software engineering techniques. Despite intensive research in the past decade, existing techniques are still unsatisfactory in detecting "functional" code clones. In particular, existing techniques cannot efficiently extract syntax and semantics information from source code. In this paper, we propose a novel joint code representation that applies fusion embedding techniques to learn hidden syntactic and semantic features of source codes. Besides, we introduce a new granularity for functional code clone detection. Our approach regards the connected methods with caller-callee relationships as a functionality and the method without any caller-callee relationship with other methods represents a single functionality. Then we train a supervised deep learning model to detect functional code clones. We conduct evaluations on a large dataset of C++ programs and the experimental results show that fusion learning can significantly outperform the state-of-the-art techniques in detecting functional code clones.

Author supplied keywords

Cite

CITATION STYLE

APA

Fang, C., Liu, Z., Shi, Y., Huang, J., & Shi, Q. (2020). Functional code clone detection with syntax and semantics fusion learning. In ISSTA 2020 - Proceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis (pp. 516–527). Association for Computing Machinery, Inc. https://doi.org/10.1145/3395363.3397362

Functional code clone detection with syntax and semantics fusion learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions