An efficient histogram method for outlier detection

Matthew Gebski; Raymond K. Wong

Conference Proceedings

An efficient histogram method for outlier detection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2007) 4443 LNCS 176-187

DOI: 10.1007/978-3-540-71703-4_17

11Citations

11Readers

Get full text

Abstract

An important problem in database and data mining systems is the detection of outlying points. It is often the case that data observations exhibiting atypical properties are of more interest than those fitting common patterns. While anomaly and outlier detection have received considerable attention from the statistics community, these approaches are primarily focused on analysis of data sets containing relatively few and univariate observations. Recently, valuable approaches have been proposed to facilitate multidimensional analysis for larger data sets. Unfortunately, these approaches are often expensive and require numerous comparisons between each point and the remainder of the data. We propose an approach using histograms for outlier detection. Sparse regions of the data are recognised and used for identifying points that are likely to be outliers. An extensive experimental evaluation demonstrates the efficiency of our approach under a number of circumstances with varying parameters on real world and synthetic data sets. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Gebski, M., & Wong, R. K. (2007). An efficient histogram method for outlier detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4443 LNCS, pp. 176–187). Springer Verlag. https://doi.org/10.1007/978-3-540-71703-4_17

An efficient histogram method for outlier detection

Abstract

Cite

Register to see more suggestions