The effectiveness of sampling methods for the imbalanced network intrusion detection data set

Kok Chin Khor; Choo Yee Ting; Somnuk Phon-Amnuaisuk

Journal Article

The effectiveness of sampling methods for the imbalanced network intrusion detection data set

Advances in Intelligent Systems and Computing (2014) 287 613-622

DOI: 10.1007/978-3-319-07692-8_58

7Citations

9Readers

Get full text

Abstract

One of the countermeasures taken by security experts against network attacks is by implementing Intrusion Detection Systems (IDS) in computer networks. Researchers often utilize the de facto network intrusion detection data set, KDD Cup 1999, to evaluate proposed IDS in the context of data mining. However, the imbalanced class distribution of the data set leads to a rare class problem. The problem causes low detection (classification) rates for the rare classes, particularly R2L and U2R. Two commonly used sampling methods to mitigate the rare class problem were evaluated in this research, namely, (1) under-sampling and (2) over-sampling. However, these two methods were less effective in mitigating the problem. The reasons of such performance are presented in this paper.

Author supplied keywords

Cite

CITATION STYLE

APA

Khor, K. C., Ting, C. Y., & Phon-Amnuaisuk, S. (2014). The effectiveness of sampling methods for the imbalanced network intrusion detection data set. Advances in Intelligent Systems and Computing, 287, 613–622. https://doi.org/10.1007/978-3-319-07692-8_58

The effectiveness of sampling methods for the imbalanced network intrusion detection data set

Abstract

Author supplied keywords

Cite

Register to see more suggestions