Domain Generation Algorithm Detection Utilizing Model Hardening Through GAN-Generated Adversarial Examples

Nathaniel Gould; Taishi Nishiyama; Kazunori Kamiya

Conference Proceedings

Domain Generation Algorithm Detection Utilizing Model Hardening Through GAN-Generated Adversarial Examples

Communications in Computer and Information Science (2020) 1271 CCIS 84-101

DOI: 10.1007/978-3-030-59621-7_5

1Citations

5Readers

Get full text

Abstract

Modern malware families often utilize Domain Generation Algorithms (DGAs) to register addresses for their Command and Control (C&C) servers. Instead of hardcoding the address of the C&C domain in the malware, DGAs are used to frequently change the address of the C&C server, causing static detection methods, such as blacklists, to be ineffective. In response, DGA detection methods have been proposed which attempt to detect these DGA-produced domains in live traffic. Previous research has investigated using domains generated from a Generative Adversarial Network (GAN) to increase the ability of a detection model to detect unseen DGA variants. Building upon this concept, we test a similar experiment using an improved GAN and detection model. For the GAN, we train a Gradient Penalty Wasserstein GAN using benign domains as an input to produce set generated domains that are difficult to differentiate from real domains. The resulting set of domains have characteristics, such as character distribution, that more closely resemble real domains than sets produced in previous research. We then use these GAN-produced domains as additional examples of DGA domains and use them to augment the training set for a DGA detection model. While a feature engineering approach has been used in previous research, we use a deep learning, convolutional neural network and long short-term memory based detection model which had significantly higher hold-out detection rates for many DGA families. After training, we evaluate the model by comparing its detection rate on several holdout DGA families with GAN augmentation compared to the same model which used an augmented training set. This is shown to increase the detection rate of the classifier (at a standardized false positive rate) on certain DGA families. Further, unlike previous approaches, we conduct significance testing on the resulting detection rates to more accurately show the effect that adversarial hardening had on the model.

Author supplied keywords

Cite

CITATION STYLE

APA

Gould, N., Nishiyama, T., & Kamiya, K. (2020). Domain Generation Algorithm Detection Utilizing Model Hardening Through GAN-Generated Adversarial Examples. In Communications in Computer and Information Science (Vol. 1271 CCIS, pp. 84–101). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-59621-7_5

Domain Generation Algorithm Detection Utilizing Model Hardening Through GAN-Generated Adversarial Examples

Abstract

Author supplied keywords

Cite

Register to see more suggestions