A new N-gram feature extraction-selection method for malicious code

Hamid Parvin; Behrouz Minaei; Hossein Karshenas; Akram Beigi

Conference Proceedings

A new N-gram feature extraction-selection method for malicious code

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6594 LNCS(PART 2) 98-107

DOI: 10.1007/978-3-642-20267-4_11

18Citations

16Readers

Get full text

Abstract

N-grams are the basic features commonly used in sequence-based malicious code detection methods in computer virology research. The empirical results from previous works suggest that, while short length n-grams are easier to extract, the characteristics of the underlying executables are better represented in lengthier n-grams. However, by increasing the length of an n-gram, the feature space grows in an exponential manner and much space and computational resources are demanded. And therefore, feature selection has turned to be the most challenging step in establishing an accurate detection system based on byte n-grams. In this paper we propose an efficient feature extraction method where in order to gain more information; both adjacent and non-adjacent bi-grams are used. Additionally, we present a novel boosting feature selection method based on genetic algorithm. Our experimental results indicate that the proposed detection system detects virus programs far more accurately than the best earlier known methods. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Parvin, H., Minaei, B., Karshenas, H., & Beigi, A. (2011). A new N-gram feature extraction-selection method for malicious code. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6594 LNCS, pp. 98–107). https://doi.org/10.1007/978-3-642-20267-4_11

A new N-gram feature extraction-selection method for malicious code

Abstract

Author supplied keywords

Cite

Register to see more suggestions