Performance Diagnosis in Cloud Microservices Using Deep Learning

Li Wu; Jasmin Bogatinovski; Sasho Nedelkoski; Johan Tordsson; Odej Kao

Conference Proceedings

Performance Diagnosis in Cloud Microservices Using Deep Learning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2021) 12632 LNCS 85-96

DOI: 10.1007/978-3-030-76352-7_13

13Citations

26Readers

Get full text

Abstract

Microservice architectures are increasingly adopted to design large-scale applications. However, the highly distributed nature and complex dependencies of microservices complicate automatic performance diagnosis and make it challenging to guarantee service level agreements (SLAs). In particular, identifying the culprits of a microservice performance issue is extremely difficult as the set of potential root causes is large and issues can manifest themselves in complex ways. This paper presents an application-agnostic system to locate the culprits for microservice performance degradation with fine granularity, including not only the anomalous service from which the performance issue originates but also the culprit metrics that correlate to the service abnormality. Our method first finds potential culprit services by constructing a service dependency graph and next applies an autoencoder to identify abnormal service metrics based on a ranked list of reconstruction errors. Our experimental evaluation based on injection of performance anomalies to a microservice benchmark deployed in the cloud shows that our system achieves a good diagnosis result, with 92% precision in locating culprit service and 85.5% precision in locating culprit metrics.

Author supplied keywords

Cite

CITATION STYLE

APA

Wu, L., Bogatinovski, J., Nedelkoski, S., Tordsson, J., & Kao, O. (2021). Performance Diagnosis in Cloud Microservices Using Deep Learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12632 LNCS, pp. 85–96). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-76352-7_13

Performance Diagnosis in Cloud Microservices Using Deep Learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions