Unsupervised ensemble learning for mining top-n outliers

Jun Gao; Weiming Hu; Zhongfei Zhang; Ou Wu

Conference Proceedings

Unsupervised ensemble learning for mining top-n outliers

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7301 LNAI(PART 1) 418-430

DOI: 10.1007/978-3-642-30217-6_35

4Citations

7Readers

Get full text

Abstract

Outlier detection is an important and attractive problem in knowledge discovery in large datasets. Instead of detecting an object as an outlier, we study detecting the n most outstanding outliers, i.e. the top-n outlier detection. Further, we consider the problem of combining the top-n outlier lists from various individual detection methods. A general framework of ensemble learning in the top-n outlier detection is proposed based on the rank aggregation techniques. A score-based aggregation approach with the normalization method of outlier scores and an order-based aggregation approach based on the distance-based Mallows model are proposed to accommodate various scales and characteristics of outlier scores from different detection methods. Extensive experiments on several real datasets demonstrate that the proposed approaches always deliver a stable and effective performance independent of different datasets in a good scalability in comparison with the state-of-the-art literature. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Gao, J., Hu, W., Zhang, Z., & Wu, O. (2012). Unsupervised ensemble learning for mining top-n outliers. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7301 LNAI, pp. 418–430). https://doi.org/10.1007/978-3-642-30217-6_35

Unsupervised ensemble learning for mining top-n outliers

Abstract

Cite

Register to see more suggestions