In this paper, we analyze current methods to distinguish malware from benign software using Machine Learning (ML) and feature engineering techniques that have been implemented in recent years. Moreover, we build a new dataset based on API calls gathered from software analysis, conforming more than 30000 samples belonging to malware as well as benign software. Finally, we test this dataset with an existing model that achieves accuracy rates close to 97% with a different, smaller dataset, identifying interesting results that can open new research opportunities in this field.
CITATION STYLE
Torres, M., Álvarez, R., & Cazorla, M. (2023). Improving Malware Detection with a Novel Dataset Based on API Calls. In Lecture Notes in Networks and Systems (Vol. 531 LNNS, pp. 289–298). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-18050-7_28
Mendeley helps you to discover research relevant for your work.