Using Sparse Composite Document Vectors to Classify VBA Macros

5Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

To detect new macro malware, NLP-based detection methods have been proposed. These methods mainly use a Doc2vec model to represent the source code, which provides a vector space to classify malicious macros and benign ones. Recently, more sophisticated models outperform Doc2vec in performance and time complexity. However, there is no study to compare these language models for macro malware detection. In this paper, we focus on Sparse Composite Document Vectors (SCDV), which is a simple feature construction algorithm. To evaluate the performance for malware detection, we compare SCDV and other language models: Bag-of Words, Latent Semantic Indexing (LSI), Doc2vec. The experimental result with actual macro malware shows the most suitable language model for macro malware detection.

Cite

CITATION STYLE

APA

Mimura, M. (2019). Using Sparse Composite Document Vectors to Classify VBA Macros. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11928 LNCS, pp. 714–720). Springer. https://doi.org/10.1007/978-3-030-36938-5_46

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free