Evaluating fuzz testing

George Klees; Andrew Ruef; Benji Cooper; Shiyi Wei; Michael Hicks

Conference ProceedingsOPEN ACCESS

Evaluating fuzz testing

Proceedings of the ACM Conference on Computer and Communications Security (2018) 2123-2138

DOI: 10.1145/3243734.3243804

491Citations

373Readers

Abstract

Fuzz testing has enjoyed great success at discovering security critical bugs in real software. Recently, researchers have devoted significant effort to devising new fuzzing techniques, strategies, and algorithms. Such new ideas are primarily evaluated experimentally so an important question is: What experimental setup is needed to produce trustworthy results? We surveyed the recent research literature and assessed the experimental evaluations carried out by 32 fuzzing papers. We found problems in every evaluation we considered. We then performed our own extensive experimental evaluation using an existing fuzzer. Our results showed that the general problems we found in existing experimental evaluations can indeed translate to actual wrong or misleading assessments. We conclude with some guidelines that we hope will help improve experimental evaluations of fuzz testing algorithms, making reported results more robust.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Klees, G., Ruef, A., Cooper, B., Wei, S., & Hicks, M. (2018). Evaluating fuzz testing. In Proceedings of the ACM Conference on Computer and Communications Security (pp. 2123–2138). Association for Computing Machinery. https://doi.org/10.1145/3243734.3243804

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 167

78%

Researcher 28

13%

Professor / Associate Prof. 12

Lecturer / Post doc 6

Readers' Discipline

Computer Science 219

93%

Engineering 12

Mathematics 3

Design 2

Article Metrics

Mentions

News Mentions: 1

View details >

Evaluating fuzz testing

Abstract

Author supplied keywords

References Powered by Scopus

A critique and improvement of the CL common language effect size statistics of McGraw and Wong

A practical guide for using statistical tests to assess randomized algorithms in software engineering

Driller: Augmenting Fuzzing Through Selective Symbolic Execution

Cited by Powered by Scopus

The Art, Science, and Engineering of Fuzzing: A Survey

Sfuzz: An efficient adaptive fuzzer for solidity smart contracts

Superion: Grammar-Aware Greybox Fuzzing

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline

Article Metrics