Supervised deep features for Software functional clone detection by exploiting lexical and syntactical information in source code

292Citations
Citations of this article
130Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Software clone detection, aiming at identifying out code fragments with similar functionalities, has played an important role in software maintenance and evolution. Many clone detection approaches have been proposed. However, most of them represent source codes with hand-crafted features using lexical or syntactical information, or unsuper-vised deep features, which makes it difficult to detect the functional clone pairs, i.e., pieces of codes with similar functionality but differing in both syntactical and lexical level. In this paper, we address the software functional clone detection problem by learning supervised deep features. We formulate the clone detection as a supervised learning to hash problem and propose an end-to-end deep feature learning framework called CDLH for functional clone detection. Such framework learns hash codes by exploiting the lexical and syntactical information for fast computation of functional similarity between code fragments. Experiments on software clone detection benchmarks indicate that the CDLH approach is effective and outperforms the state-of-the-art approaches in software functional clone detection.

Cite

CITATION STYLE

APA

Wei, H. H., & Li, M. (2017). Supervised deep features for Software functional clone detection by exploiting lexical and syntactical information in source code. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 0, pp. 3034–3040). International Joint Conferences on Artificial Intelligence. https://doi.org/10.24963/ijcai.2017/423

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free