Abstract
System instance clustering is crucial for large-scale Web services because it can significantly reduce the training overhead of anomaly detection methods. However, the vast number of system instances with massive time points, redundant metrics, and noise bring significant challenges. We propose OmniCluster to accurately and efficiently cluster system instances for large-scale Web services. It combines a one-dimensional convolutional autoencoder (1D-CAE), which extracts the main features of system instances, with a simple, novel, yet effective three-step feature selection strategy. We evaluated OmniCluster using real-world data collected from a top-tier content service provider providing services for one billion+ monthly active users (MAU), proving that OmniCluster achieves high accuracy (NMI=0.9160) and reduces the training overhead of five anomaly detection models by 95.01% on average.
Author supplied keywords
Cite
CITATION STYLE
Zhang, S., Li, D., Zhong, Z., Zhu, J., Liang, M., Luo, J., … Liu, Y. (2022). Robust System Instance Clustering for Large-Scale Web Services. In WWW 2022 - Proceedings of the ACM Web Conference 2022 (pp. 1785–1796). Association for Computing Machinery, Inc. https://doi.org/10.1145/3485447.3511983
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.