TRACK: Optimizing Artificial Neural Networks for Anomaly Detection in Spark Streaming Systems

2Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Due to the growth of Big Data processing technologies and cloud computing services, it is common to have multiple tenants share the same computing resources, which may cause performance anomalies. There is an urgent need for an effective performance anomaly detection method that can be used within the production environment to avoid any late detection of unexpected system failures. To address this challenge, we introduce, TRACK, a new black-box training workload configuration optimization with a neural network driven methodology to identify anomalous performance in an in-memory Big Data Spark streaming platform. The proposed methodology revolves around using Bayesian optimization to find the optimal training dataset size and configuration parameters to train the model efficiently. TRACK is validated on a real Apache Spark streaming system and the results show that the TRACK achieves the highest performance (95% for F-score) and reduces the training time by 80% to efficiently train the proposed anomaly detection model in the in-memory streaming platform.

Cite

CITATION STYLE

APA

Alnafessah, A. S., & Casale, G. (2020). TRACK: Optimizing Artificial Neural Networks for Anomaly Detection in Spark Streaming Systems. In ACM International Conference Proceeding Series (pp. 188–191). Association for Computing Machinery. https://doi.org/10.1145/3388831.3388860

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free