Malicious PDF files detection using structural and javascript based features

Sonal Dabral; Amit Agarwal; Manish Mahajan; Sachin Kumar

Conference Proceedings

Malicious PDF files detection using structural and javascript based features

Communications in Computer and Information Science (2017) 750 137-147

DOI: 10.1007/978-981-10-6544-6_14

3Citations

8Readers

Get full text

Abstract

Malicious PDF files recently considered one of the most dangerous threats to the system security. The flexible code-bearing vector of the PDF format enables to attacker to carry out malicious code on the computer system for user exploitation. Many solutions have been developed by security agents for the safety of user’s system, but still inadequate. In this paper, we propose a method for malicious PDF file detection via machine learning approach. The proposed method extract features from PDF file structure and embedded JavaScript code that leverage on advanced parsing mechanism. Instead of looking for the specific attack inside the content of PDF i.e. quite complex procedure, we extract features that are often used for attacks. Moreover, we present the experimental evidence for the choice of learning algorithm to provide the remarkably high accuracy as compared to other existing methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Dabral, S., Agarwal, A., Mahajan, M., & Kumar, S. (2017). Malicious PDF files detection using structural and javascript based features. In Communications in Computer and Information Science (Vol. 750, pp. 137–147). Springer Verlag. https://doi.org/10.1007/978-981-10-6544-6_14

Malicious PDF files detection using structural and javascript based features

Abstract

Author supplied keywords

Cite

Register to see more suggestions