Spam Detection Using Bidirectional Transformers and Machine Learning Classifier Algorithms

Yanhui Guo; Zelal Mustafaoglu; Deepika Koundal

Journal ArticleOPEN ACCESS

Spam Detection Using Bidirectional Transformers and Machine Learning Classifier Algorithms

Journal of Computational and Cognitive Engineering (2023) 2(1) 5-9

DOI: 10.47852/bonviewJCCE2202192

159Citations

53Readers

Abstract

Spam email has accounted for a high percentage of email traffic and has created problems worldwide. The deep learning transformer model is an efficient tool in natural language processing. This study proposed an efficient spam detection approach using a pretrained bidirectional encoder representation from transformer (BERT) and machine learning algorithms to classify ham or spam emails. Email texts were fed into the BERT, and features obtained from the BERT outputs were used to represent the texts. Four classifier algorithms in machine learning were employed to classify the features of the text into ham or spam categories. The proposed model was tested using two public datasets in the experiments. The results of the evaluation metrics demonstrate that the logistic regression algorithm achieved the best classification performance in both datasets. They also justified the efficient ability of the proposed model in detecting spam emails.

Author supplied keywords

Cite

CITATION STYLE

APA

Guo, Y., Mustafaoglu, Z., & Koundal, D. (2023). Spam Detection Using Bidirectional Transformers and Machine Learning Classifier Algorithms. Journal of Computational and Cognitive Engineering, 2(1), 5–9. https://doi.org/10.47852/bonviewJCCE2202192

Spam Detection Using Bidirectional Transformers and Machine Learning Classifier Algorithms

Abstract

Author supplied keywords

Cite

Register to see more suggestions