Deep character-level anomaly detection based on a convolutional autoencoder for zero-day phishing url detection

Seok Jun Bu; Sung Bae Cho

Journal ArticleOPEN ACCESS

Deep character-level anomaly detection based on a convolutional autoencoder for zero-day phishing url detection

Electronics (Switzerland) (2021) 10(12)

DOI: 10.3390/electronics10121492

36Citations

76Readers

Abstract

Considering the fatality of phishing attacks, the data-driven approach using massive URL observations has been verified, especially in the field of cyber security. On the other hand, the supervised learning approach relying on known attacks has limitations in terms of robustness against zero-day phishing attacks. Moreover, it is known that it is critical for the phishing detection task to fully exploit the sequential features from the URL characters. Taken together, to ensure both sustainability and intelligibility, we propose the combination of a convolution operation to model the character-level URL features and a deep convolutional autoencoder (CAE) to consider the nature of zero-day attacks. Extensive experiments on three real-world datasets consisting of 222,541 URLs showed the highest performance among the latest deep-learning methods. We demonstrated the superiority of the proposed method by receiver-operating characteristic (ROC) curve analysis in addition to 10-fold cross-validation and confirmed that the sensitivity improved by 3.98% compared to the latest deep model.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Bu, S. J., & Cho, S. B. (2021). Deep character-level anomaly detection based on a convolutional autoencoder for zero-day phishing url detection. Electronics (Switzerland), 10(12). https://doi.org/10.3390/electronics10121492

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 11

52%

Lecturer / Post doc 6

29%

Professor / Associate Prof. 3

14%

Researcher 1

Readers' Discipline

Computer Science 17

74%

Engineering 5

22%

Agricultural and Biological Sciences 1

Deep character-level anomaly detection based on a convolutional autoencoder for zero-day phishing url detection

Abstract

Author supplied keywords

References Powered by Scopus

A state-of-the-art survey of malware detection approaches using data mining techniques

Image-Based malware classification using ensemble of CNN architectures (IMCEC)

Empirical evaluation and new design for fighting evolving twitter spammers

Cited by Powered by Scopus

A systematic literature review on phishing website detection techniques

A Survey of Intelligent Detection Designs of HTML URL Phishing Attacks

A Deep Learning-Based Framework for Phishing Website Detection

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline