Hierarchically clustered LSH for hierarchical outliers detection

Konstantinos Georgoulas; Yannis Kotidis

Conference Proceedings

Hierarchically clustered LSH for hierarchical outliers detection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9827 LNCS 169-184

DOI: 10.1007/978-3-319-44403-1_11

0Citations

7Readers

Get full text

Abstract

In this work we introduce hierarchical outliers that extend the notion of distance-based outliers for handling hierarchical data domains. We present a novel framework that permits us to detect hierarchical outliers in a consistent manner, providing a desired monotonicity property, which implies that a data observation that finds enough support so as to be disregarded as an outlier at a level of the hierarchy, will not be labelled as an outlier when examined at a more coarse-grained level above. This way, we enable users to grade how suspicious a data observation is, depending on the number of hierarchical levels for which the observation is found to be an outlier. Our technique utilizes an innovative locality sensitive hashing indexing scheme, where data points sharing the same hash value are being clustered. The computed centroids are maintained by our framework’s scheme index while detailed data descriptors are discarded. This results in reduced storage space needs, execution time and number of distance evaluations compared to utilizing a straightforward LSH index.

Cite

CITATION STYLE

APA

Georgoulas, K., & Kotidis, Y. (2016). Hierarchically clustered LSH for hierarchical outliers detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9827 LNCS, pp. 169–184). Springer Verlag. https://doi.org/10.1007/978-3-319-44403-1_11

Hierarchically clustered LSH for hierarchical outliers detection

Abstract

Cite

Register to see more suggestions