Context. The urgent task for feature informativeness evaluation of a large amount of data has been solved. The object of the study was a network traffic. Objective is to analyze the data informativeness for network traffic anomalies detection in order to reduce the feature space. Method. The approach for feature informativeness evaluation of a large amount of data is proposed to increase the accuracy of the anomaly detection in network traffic. It also substantially increases the computation speed of the classification algorithms. The characteristics of a random forest and Firefly algorithms are considered. In the paper, an algorithm for feature selection based on the integration of these algorithms is proposed. Features are sorted in descending order according to their importance, the least informative ones are not considered. The decision trees, naive Bayes, Bayesian classifier, additive logistic regression and k-nearest neighbors method are considered as classifiers. The quality of the classification results is estimated using six evaluation metrics: true positive rate, false positive rate, precision, recall, Fmeasure and AUC. Results. The experiments have been performed in the Matlab environment ( 2016a) on the NSL-KDD data set, using the proposed algorithm. The best classification results for the selected features have been obtained using k-nearest neighbors method. Conclusions. The conducted experiments have confirmed the efficiency of the proposed approach and allow recommending it for practical use in feature informativeness evaluation in order to reduce the feature space and increase the computation speed of the classification algorithms. In addition, in order to further study the effectiveness of anomaly detection in network traffic, a real data set will be used.
CITATION STYLE
Imamverdiyev, Y. N., & Sukhostat, L. V. (2017). NETWORK TRAFFIC ANOMALIES DETECTION BASED ON INFORMATIVE FEATURES. Radio Electronics, Computer Science, Control, 0(3), 113–120. https://doi.org/10.15588/1607-3274-2017-3-13
Mendeley helps you to discover research relevant for your work.