Malicious domain detection based on k-means and smote

Qing Wang; Linyu Li; Bo Jiang; Zhigang Lu; Junrong Liu; Shijie Jian

Conference ProceedingsOPEN ACCESS

Malicious domain detection based on k-means and smote

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12138 LNCS 468-481

DOI: 10.1007/978-3-030-50417-5_35

13Citations

20Readers

Abstract

The Domain Name System (DNS) as the foundation of Internet, has been widely used by cybercriminals. A lot of malicious domain detection methods have received significant success in the past decades. However, existing detection methods usually use classification-based and association-based representations, which are not capable of dealing with the imbalanced problem between malicious and benign domains. To solve the problem, we propose a novel domain detection system named KSDom. KSDom designs a data collector to collect a large number of DNS traffic data and rich external DNS-related data, then employs K-means and SMOTE method to handle the imbalanced data. Finally, KSDom uses Categorical Boosting (CatBoost) algorithm to identify malicious domains. Comprehensive experimental results clearly show the effectiveness of our KSDom system and prove its good robustness in imbalanced datasets with different ratios. KSDom still has high accuracy even in extremely imbalanced DNS traffic.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, Q., Li, L., Jiang, B., Lu, Z., Liu, J., & Jian, S. (2020). Malicious domain detection based on k-means and smote. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12138 LNCS, pp. 468–481). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-50417-5_35

Malicious domain detection based on k-means and smote

Abstract

Author supplied keywords

Cite

Register to see more suggestions