Anomaly detection in categorical datasets using bayesian networks

Lida Rashidi; Sattar Hashemi; Ali Hamzeh

Conference Proceedings

Anomaly detection in categorical datasets using bayesian networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 7003 LNAI(PART 2) 610-619

DOI: 10.1007/978-3-642-23887-1_78

9Citations

14Readers

Get full text

Abstract

In this paper we present a method for finding anomalous records in categorical or mixed datasets in an unsupervised fashion. Since the data in many problems consist of normal records with a small minority of anomalies, many approaches build a model from the training data and compare the test records against it. But instead of building a model, we keep track of the number of occurrences of different attribute value combinations. We also consider a more meaningful definition of anomalies and incorporate the Bayesian network structure in it. A scoring technique is defined for each test record. In this procedure we combine supports of different rules according to the Bayesian network structure in order to determine the label of the test instances. As it is shown in the results, our proposed method has a higher or similar f-measure and precision compared to a Bayesian network based approach in all cases. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Rashidi, L., Hashemi, S., & Hamzeh, A. (2011). Anomaly detection in categorical datasets using bayesian networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7003 LNAI, pp. 610–619). https://doi.org/10.1007/978-3-642-23887-1_78

Anomaly detection in categorical datasets using bayesian networks

Abstract

Author supplied keywords

Cite

Register to see more suggestions