Multimodal malware classification using proposed ensemble deep neural network framework

Sadia Nazim; Muhammad Mansoor Alam; Safdar Rizvi; Jawahir Che Mustapha; Syed Shujaa Hussain; Mazliham Mohd Su’ud

Journal ArticleOPEN ACCESS

Multimodal malware classification using proposed ensemble deep neural network framework

Scientific Reports (2025) 15(1)

DOI: 10.1038/s41598-025-96203-3

15Citations

95Readers

Abstract

In the contemporary technological world, fortifying cybersecurity defense against dynamic threat landscapes is imperative. Malware detectors play a critical role in this endeavor, utilizing various techniques such as statistical analysis, static and dynamic analysis, and machine learning (ML) to compare signatures and identify threats. Deep learning (DL) aids in accurately classifying complex malware features. The cross-domain research in data fusion strives to integrate information from multiple sources to augment reliability and minimize errors in detecting sophisticated cyber threats. This collaborative approach is the least addressed and pivotal for protecting against the advancing environment of modern malware attacks. This study presents a state-of-the-art malware analysis framework that employs a multimodal approach by integrating malware images and numeric features for effective malware classification. The experiments are performed sequentially, encompassing data preprocessing, feature selection using Neighbourhood Component Analysis (NCA), and dataset balancing with Synthetic Minority Over-sampling Technique (SMOTE). Subsequently, the late fusion technique is utilized for multimodal classification by employing Random Under Sampling and Boosting (RUSBoost) and the proposed ensemble deep neural network. The RUSBoost technique involves random undersampling and adaptive boosting to moderate bias toward majority classes while improving minority class (malware) detection. Multimodal Late fusion experimental results (95.36%) of RUSBoost (numeric) and the proposed model (imagery) outperform the standalone prevailing results for imagery (95.02%) and numeric (93.36%) data. The effectiveness of the proposed model is verified through the evaluation metrics such as Recall (86.5%), F1-score (85.0%), and Precision (79.9%). The multimodal late fusion of numeric and visual data makes the model more robust in detecting diverse malware variants. The experimental outcomes demonstrate that multimodal analysis may efficiently increase the identification strength and accuracy, especially when majority vote and bagging are employed for late fusion.

Author supplied keywords

Cite

CITATION STYLE

APA

Nazim, S., Alam, M. M., Rizvi, S., Mustapha, J. C., Hussain, S. S., & Su’ud, M. M. (2025). Multimodal malware classification using proposed ensemble deep neural network framework. Scientific Reports, 15(1). https://doi.org/10.1038/s41598-025-96203-3

Multimodal malware classification using proposed ensemble deep neural network framework

Abstract

Author supplied keywords

Cite

Register to see more suggestions