Helix: DGA Domain Embeddings for Tracking and Exploring Botnets

Lior Sidi; Yisroel Mirsky; Asaf Nadler; Yuval Elovici; Asaf Shabtai

Conference ProceedingsOPEN ACCESS

Helix: DGA Domain Embeddings for Tracking and Exploring Botnets

International Conference on Information and Knowledge Management, Proceedings (2020) 2741-2748

DOI: 10.1145/3340531.3416022

8Citations

11Readers

Get full text

Abstract

Botnets have been using domain generation algorithms (DGA) for over a decade to covertly and robustly identify the domain name of their command and control servers (C&C). Recent advancements in DGA detection has motivated botnet owners to rapidly alter the C&C domain and use adversarial techniques to evade detection. As a result, it has become increasingly difficult to track botnets in DNS traffic. In this paper, we present Helix, a method for tracking and exploring botnets. Helix uses a spatio-temporal deep neural network autoencoder to convert domains into numerical vectors (embeddings) which capture the DGA and seed used to create the domain. This is made possible by leveraging both convolutional (spatial) and recurrent (temporal) layers, and by using techniques such as attention mechanisms and highways. Furthermore, by using an autoencoder architecture, the network can be trained in an unsupervised manner (no labeling of data) which makes the system practical for real world deployments. In our evaluation, we found that Helix can track botnet campaigns, distinguish between DGA families and seeds, and can identify domains generated using the latest adversarial machine learning techniques. Helix is currently being used to track botnets in one of the world's largest Internet Service Providers (ISP), and we include some of the ISP's analysis work using our method.

Author supplied keywords

Cite

CITATION STYLE

APA

Sidi, L., Mirsky, Y., Nadler, A., Elovici, Y., & Shabtai, A. (2020). Helix: DGA Domain Embeddings for Tracking and Exploring Botnets. In International Conference on Information and Knowledge Management, Proceedings (pp. 2741–2748). Association for Computing Machinery. https://doi.org/10.1145/3340531.3416022

Helix: DGA Domain Embeddings for Tracking and Exploring Botnets

Abstract

Author supplied keywords

Cite

Register to see more suggestions