This research was conducted based on a report from the United States Federal Trade Commission regarding fraud through electronic text messages via SMS that fraudsters use to manipulate potential victims. Usually, scammers spread SMS spam as an intermediary for the crime. The development of a supervised learning algorithm is applied to predict SMS spam into three categories, such as SMS spam, SMS fraud, and promotional SMS. The prediction system is dividing into several stages in the development process, including data labelling, data preprocessing, modelling, and model validation. The known accuracy based on modelling using Logistic Regression using a test size of 15% is 99%, using a test size of 20% is 99%, and using a test size of 25% is 98%. The Multinomial Naïve Bayes algorithm's accuracy with a test size of 15%, 20%, 25% is 97%. So, the SMS spam prediction approach uses the logistic regression method, which has the highest accuracy.
CITATION STYLE
Raharja, P. A., Sidiq, M. F., & Fransisca, D. C. (2022). Comparative Analysis of Multinomial Naïve Bayes and Logistic Regression Models for Prediction of SMS Spam. JURNAL MEDIA INFORMATIKA BUDIDARMA, 6(3), 1290. https://doi.org/10.30865/mib.v6i3.4019
Mendeley helps you to discover research relevant for your work.