Learning Types for Binaries

Zhiwu Xu; Cheng Wen; Shengchao Qin

Conference Proceedings

Learning Types for Binaries

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10610 LNCS 430-446

DOI: 10.1007/978-3-319-68690-5_26

10Citations

7Readers

Get full text

Abstract

Type inference for Binary codes is a challenging problem due partly to the fact that much type-related information has been lost during the compilation from high-level source code. Most of the existing research on binary code type inference tend to resort to program analysis techniques, which can be too conservative to infer types with high accuracy or too heavy-weight to be viable in practice. In this paper, we propose a new approach to learning types for recovered variables from their related representative instructions. Our idea is motivated by “duck typing”, where the type of a variable is determined by its features and properties. Our approach first learns a classifier from existing binaries with debug information and then uses this classifier to predict types for new, unseen binaries. We have implemented our approach in a tool called BITY and used it to conduct some experiments on a well-known benchmark coreutils (v8.4). The results show that our tool is more precise than the commercial tool Hey-Rays, both in terms of correct types and compatible types.

Cite

CITATION STYLE

APA

Xu, Z., Wen, C., & Qin, S. (2017). Learning Types for Binaries. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10610 LNCS, pp. 430–446). Springer Verlag. https://doi.org/10.1007/978-3-319-68690-5_26

Learning Types for Binaries

Abstract

Cite

Register to see more suggestions