This paper examines the impact of data balancing algorithms in the network traffic classification problem on various types of distributed denial of service attacks on the CICDDoS2019 dataset, which contains information about reflection-based and exploitation-based attacks. The results of computational experiments have shown the effectiveness of data balancing algorithms such as naive random sampling, synthetic minority sampling, and adaptive synthetic sampling in identifying network attacks. A comparative analysis of various data sampling approaches has shown that the adaptive synthetic sampling method with the random forest algorithm demonstrates the highest classification accuracy.
CITATION STYLE
Bolodurina, I., Shukhman, A., Parfenov, D., Zhigalov, A., & Zabrodina, L. (2020). Investigation of the problem of classifying unbalanced datasets in identifying distributed denial of service attacks. In Journal of Physics: Conference Series (Vol. 1679). IOP Publishing Ltd. https://doi.org/10.1088/1742-6596/1679/4/042020
Mendeley helps you to discover research relevant for your work.