Phishing Website Detection With Semantic Features Based on Machine Learning Classifiers: A Comparative Study

Ammar Almomani; Mohammad Alauthman; Mohd Taib Shatnawi; Mohammed Alweshah; Ayat Alrosan; Waleed Alomoush; Brij B. Gupta

Journal ArticleOPEN ACCESS

Phishing Website Detection With Semantic Features Based on Machine Learning Classifiers: A Comparative Study

International Journal on Semantic Web and Information Systems (2022) 18(1)

DOI: 10.4018/IJSWIS.297032

107Citations

94Readers

Abstract

The phishing attack is one of the main cybersecurity threats in web phishing and spear phishing. Phishing websites continue to be a problem. One of the main contributions to the study was working and extracting the URL and domain identity feature, abnormal features, HTML and JavaScript features, and domain features as semantic features to detect phishing websites, which makes the process of classification using those semantic features more controllable and more effective. The current study used the machine learning model algorithms to detect phishing websites, and comparisons were made. The authors have used 16 machine learning models adopted with 10 semantic features that represent the most effective features for the detection of phishing webpages extracted from two datasets. The GradientBoostingClassifier and RandomForestClassifier had the best accuracy based on the comparison results (i.e., about 97%). In contrast, GaussianNB and the stochastic gradient descent (SGD) classifier represent the lowest accuracy results, 84% and 81% respectively, in comparison with other classifiers.

Author supplied keywords

Cite

CITATION STYLE

APA

Almomani, A., Alauthman, M., Shatnawi, M. T., Alweshah, M., Alrosan, A., Alomoush, W., & Gupta, B. B. (2022). Phishing Website Detection With Semantic Features Based on Machine Learning Classifiers: A Comparative Study. International Journal on Semantic Web and Information Systems, 18(1). https://doi.org/10.4018/IJSWIS.297032

Phishing Website Detection With Semantic Features Based on Machine Learning Classifiers: A Comparative Study

Abstract

Author supplied keywords

Cite

Register to see more suggestions