MalwareTextDB: A database for annotated malware articles

65Citations
Citations of this article
140Readers
Mendeley users who have this article in their library.

Abstract

Cybersecurity risks and malware threats are becoming increasingly dangerous and common. Despite the severity of the problem, there has been few NLP efforts focused on tackling cybersecurity. In this paper, we discuss the construction of a new database for annotated malware texts. An annotation framework is introduced based around the MAEC vocabulary for defining malware characteristics, along with a database consisting of 39 annotated APT reports with a total of 6,819 sentences. We also use the database to construct models that can potentially help cybersecurity researchers in their data collection and analytics efforts.

Cite

CITATION STYLE

APA

Lim, S. K., Muis, A. O., Lu, W., & Ong, C. H. (2017). MalwareTextDB: A database for annotated malware articles. In ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers) (Vol. 1, pp. 1557–1567). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/P17-1143

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free