Who wrote this code? Identifying the authors of program binaries

71Citations
Citations of this article
127Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Program authorship attribution - identifying a programmer based on stylistic characteristics of code - has practical implications for detecting software theft, digital forensics, and malware analysis. Authorship attribution is challenging in these domains where usually only binary code is available; existing source code-based approaches to attribution have left unclear whether and to what extent programmer style survives the compilation process. Casting authorship attribution as a machine learning problem, we present a novel program representation and techniques that automatically detect the stylistic features of binary code. We apply these techniques to two attribution problems: identifying the precise author of a program, and finding stylistic similarities between programs by unknown authors. Our experiments provide strong evidence that programmer style is preserved in program binaries. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Rosenblum, N., Zhu, X., & Miller, B. P. (2011). Who wrote this code? Identifying the authors of program binaries. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6879 LNCS, pp. 172–189). Springer Verlag. https://doi.org/10.1007/978-3-642-23822-2_10

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free