Feature extraction and malware detection on large HTTPS data using mapreduce

Přemysl Čech; Jan Kohout; Jakub Lokoč; Tomáš Komárek; Jakub Maroušek; Tomáš Pevný

Conference Proceedings

Feature extraction and malware detection on large HTTPS data using mapreduce

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9939 LNCS 311-324

DOI: 10.1007/978-3-319-46759-7_24

10Citations

5Readers

Get full text

Abstract

Secure HTTP network traffic represents a challenging immense data source for machine learning tasks. The tasks usually try to learn and identify infected network nodes, given only limited traffic features available for secure HTTP data. In this paper, we investigate the performance of grid histograms that can be used to aggregate traffic features of network nodes considering just 5-min batches for snapshots. We compare the representation using linear and k-NN classifiers. We also demonstrate that all presented feature extraction and classification tasks can be implemented in a scalable way using the MapReduce approach.

Author supplied keywords

Cite

CITATION STYLE

APA

Čech, P., Kohout, J., Lokoč, J., Komárek, T., Maroušek, J., & Pevný, T. (2016). Feature extraction and malware detection on large HTTPS data using mapreduce. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9939 LNCS, pp. 311–324). Springer Verlag. https://doi.org/10.1007/978-3-319-46759-7_24

Feature extraction and malware detection on large HTTPS data using mapreduce

Abstract

Author supplied keywords

Cite

Register to see more suggestions