Malware classification using XGboost-Gradient boosted decision tree

Rajesh Kumar; S. Geetha

Journal ArticleOPEN ACCESS

Malware classification using XGboost-Gradient boosted decision tree

Advances in Science, Technology and Engineering Systems (2020) 5(5) 536-549

DOI: 10.25046/AJ050566

38Citations

68Readers

Abstract

In this industry 4.0 and digital era, we are more dependent on the use of communication and various transaction such as financial, exchange of information by various means. These transaction needs to be secure. Differentiation between the use of benign and malware is one way to make these transactions secure. We propose in this work a malware classification scheme that constructs a model using low-end computing resources and a very large balanced dataset for malware. To our knowledge, and search the complete dataset is used the first time with the XGBoost GBDT machine learning technique to build a classifier using low-end computing resources. The model is optimized for efficiency with the removal of noisy features by a reduction in features sets of the dataset by domain expertise in malware detection and feature importance functionality of XGboost and hyperparameter tuning. The model can be trained in low computation resources at less time in 1315 seconds with a reduction in feature set without affecting the performance for classification. The model gives improved performance for accuracy with the tuning of the hyperparameter and achieve higher accuracy of 98.5 and on par AUC of.9989.

Author supplied keywords

Cite

CITATION STYLE

APA

Kumar, R., & Geetha, S. (2020). Malware classification using XGboost-Gradient boosted decision tree. Advances in Science, Technology and Engineering Systems, 5(5), 536–549. https://doi.org/10.25046/AJ050566

Malware classification using XGboost-Gradient boosted decision tree

Abstract

Author supplied keywords

Cite

Register to see more suggestions