Content-based methodology for anomaly detection on the web

Mark Last; Bracha Shapira; Yuval Elovici; Omer Zaafrany; Abraham Kandel

Conference Proceedings

Content-based methodology for anomaly detection on the web

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2003) 2663 113-123

DOI: 10.1007/3-540-44831-4_13

18Citations

12Readers

Get full text

Abstract

As became apparent after the tragic events of September 11, 2001, terrorist organizations and other criminal groups are increasingly using the legitimate ways of Internet access to conduct their malicious activities. Such actions cannot be detected by existing intrusion detection systems that are generally aimed at protecting computer systems and networks from some kind of "cyber attacks". Preparation of an attack against the human society itself can only be detected through analysis of the content accessed by the users. The proposed study aims at developing an innovative methodology for abnormal activity detection, which uses web content as the audit information provided to the detection system. The new behavior-based detection method learns the normal behavior by applying an unsupervised clustering algorithm to the contents of publicly available web pages viewed by a group of similar users. In this paper, we represent page content by the well-known vector space model. The content models of normal behavior are used in real-time to reveal deviation from normal behavior at a specific location on the net. The detection algorithm sensitivity is controlled by a threshold parameter. The method is evaluated by the tradeoff between the detection rate (TP) and the false positive rate (FP).

Author supplied keywords

Cite

CITATION STYLE

APA

Last, M., Shapira, B., Elovici, Y., Zaafrany, O., & Kandel, A. (2003). Content-based methodology for anomaly detection on the web. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 2663, pp. 113–123). Springer Verlag. https://doi.org/10.1007/3-540-44831-4_13

Content-based methodology for anomaly detection on the web

Abstract

Author supplied keywords

Cite

Register to see more suggestions