Comparative Analysis of Multinomial Naïve Bayes and Logistic Regression Models for Prediction of SMS Spam

  • Raharja P
  • Sidiq M
  • Fransisca D
N/ACitations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

This research was conducted based on a report from the United States Federal Trade Commission regarding fraud through electronic text messages via SMS that fraudsters use to manipulate potential victims. Usually, scammers spread SMS spam as an intermediary for the crime. The development of a supervised learning algorithm is applied to predict SMS spam into three categories, such as SMS spam, SMS fraud, and promotional SMS. The prediction system is dividing into several stages in the development process, including data labelling, data preprocessing, modelling, and model validation. The known accuracy based on modelling using Logistic Regression using a test size of 15% is 99%, using a test size of 20% is 99%, and using a test size of 25% is 98%. The Multinomial Naïve Bayes algorithm's accuracy with a test size of 15%, 20%, 25% is 97%. So, the SMS spam prediction approach uses the logistic regression method, which has the highest accuracy.

Cite

CITATION STYLE

APA

Raharja, P. A., Sidiq, M. F., & Fransisca, D. C. (2022). Comparative Analysis of Multinomial Naïve Bayes and Logistic Regression Models for Prediction of SMS Spam. JURNAL MEDIA INFORMATIKA BUDIDARMA, 6(3), 1290. https://doi.org/10.30865/mib.v6i3.4019

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free