In this paper we look at the task of detecting URLs corresponding to infected web pages using Machine Learning and Natural Language Processing specific features. We show that these features render better performance than the previously used hand-crafted lexical features and render similar results to the more expensive host-based features. We also introduce a new adjacent task, that of identifying URLs pointing to the download of portable executable files, and show that our models perform very well on this task too.
CITATION STYLE
Şulea, O. M., Dinu, L. P., & Peşte, A. (2015). Using NLP specific tools for non-NLP specific tasks. A web security application. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9492, pp. 631–638). Springer Verlag. https://doi.org/10.1007/978-3-319-26561-2_74
Mendeley helps you to discover research relevant for your work.