SecurityEval dataset: Mining vulnerability examples to evaluate machine learning-based code generation techniques

Mohammed Latif Siddiq; Joanna C.S. Santos

Conference ProceedingsOPEN ACCESS

SecurityEval dataset: Mining vulnerability examples to evaluate machine learning-based code generation techniques

MSR4P and S 2022 - Proceedings of the 1st International Workshop on Mining Software Repositories Applications for Privacy and Security, co-located with ESEC/FSE 2022 (2022) 29-33

DOI: 10.1145/3549035.3561184

19Citations

40Readers

Get full text

Abstract

Automated source code generation is currently a popular machine-learning-based task. It can be helpful for software developers to write functionally correct code from a given context. However, just like human developers, a code generation model can produce vulnerable code, which the developers can mistakenly use. For this reason, evaluating the security of a code generation model is a must. In this paper, we describe SecurityEval, an evaluation dataset to fulfill this purpose. It contains 130 samples for 75 vulnerability types, which are mapped to the Common Weakness Enumeration (CWE). We also demonstrate using our dataset to evaluate one open-source (i.e., InCoder) and one closed-source code generation model (i.e., GitHub Copilot).

Author supplied keywords

Cite

CITATION STYLE

APA

Siddiq, M. L., & Santos, J. C. S. (2022). SecurityEval dataset: Mining vulnerability examples to evaluate machine learning-based code generation techniques. In MSR4P and S 2022 - Proceedings of the 1st International Workshop on Mining Software Repositories Applications for Privacy and Security, co-located with ESEC/FSE 2022 (pp. 29–33). Association for Computing Machinery, Inc. https://doi.org/10.1145/3549035.3561184

SecurityEval dataset: Mining vulnerability examples to evaluate machine learning-based code generation techniques

Abstract

Author supplied keywords

Cite

Register to see more suggestions