Using data mining for static code analysis of C

Hannes Tribus; Irene Morrigl; Stefan Axelsson

Conference Proceedings

Using data mining for static code analysis of C

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7713 LNAI 603-614

DOI: 10.1007/978-3-642-35527-1_50

3Citations

13Readers

Get full text

Abstract

Static analysis of source code is one way to find bugs and problems in large software projects. Many approaches to static analysis have been proposed. We proposed a novel way of performing static analysis. Instead of methods based on semantic/logic analysis we apply machine learning directly to the problem. This has many benefits. Learning by example means trivial programmer adaptability (a problem with many other approaches), learning systems also has the advantage to be able to generalise and find problematic source code constructs that are not exactly as the programmer initially thought, to name a few. Due to the general interest in code quality and the availability of large open source code bases as test and development data, we believe this problem should be of interest to the larger data mining community. In this work we extend our previous approach and investigate a new way of doing feature selection and test the suitability of many different learning algorithms. This on a selection of problems we adapted from large publicly available open source projects. Many algorithms were much more successful than our previous proof-of-concept, and deliver practical levels of performance. This is clearly an interesting and minable problem. © Springer-Verlag 2012.

Author supplied keywords

Cite

CITATION STYLE

APA

Tribus, H., Morrigl, I., & Axelsson, S. (2012). Using data mining for static code analysis of C. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7713 LNAI, pp. 603–614). https://doi.org/10.1007/978-3-642-35527-1_50

Using data mining for static code analysis of C

Abstract

Author supplied keywords

Cite

Register to see more suggestions