Debugging distributed systems

46Citations
Citations of this article
54Readers
Mendeley users who have this article in their library.

Abstract

DISTRIBUTED SYSTEMS POSE unique challenges for software developers. Reasoning about concurrent activities of system nodes and even understanding the system's communication topology can be difficult. A standard approach to gaining insight into system activity is to analyze system logs. Unfortunately, this can be a tedious and complex process. This article looks at several key features and debugging challenges that differentiate distributed systems from other kinds of software. The article presents several promising tools and ongoing research to help resolve these challenges. Distributed systems differ from single-machine programs in ways that are simultaneously positive in providing systems with special capabilities, and negative in presenting software-development and operational challenges.

Cite

CITATION STYLE

APA

Beschastnikh, I., Wang, P., Brun, Y., & Ernst, M. D. (2016). Debugging distributed systems. Communications of the ACM, 59(8), 32–37. https://doi.org/10.1145/2909480

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free