Systematic construction of anomaly detection benchmarks from real data

Andrew F. Emmott; Shubhomoy Das; Thomas Dietterich; Alan Fern; Weng Keen Wong

Conference Proceedings

Systematic construction of anomaly detection benchmarks from real data

Proceedings of the ACM SIGKDD Workshop on Outlier Detection and Description, ODD 2013 (2013) 16-21

DOI: 10.1145/2500853.2500858

108Citations

165Readers

Get full text

Abstract

Research in anomaly detection suffers from a lack of realistic and publicly-available problem sets. This paper discusses what properties such problem sets should possess. It then introduces a methodology for transforming existing classification data sets into ground-truthed benchmark data sets for anomaly detection. The methodology produces data sets that vary along three important dimensions: (a) point diffculty, (b) relative frequency of anomalies, and (c) clusteredness. We apply our generated datasets to benchmark several popular anomaly detection algorithms under a range of different conditions.

Cite

CITATION STYLE

APA

Emmott, A. F., Das, S., Dietterich, T., Fern, A., & Wong, W. K. (2013). Systematic construction of anomaly detection benchmarks from real data. In Proceedings of the ACM SIGKDD Workshop on Outlier Detection and Description, ODD 2013 (pp. 16–21). https://doi.org/10.1145/2500853.2500858

Systematic construction of anomaly detection benchmarks from real data

Abstract

Cite

Register to see more suggestions