Benchmarking the availability and fault tolerance of Cassandra

1Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

To be able to handle big data workloads, modern NoSQL database management systems like Cassandra are designed to scale well over multiple machines. However, with each additional machine in a cluster, the likelihood for hardware failure increases. In order to still achieve high availability and fault tolerance, the data needs to be replicated within the cluster. Predictable and stable response times are required by many applications even in the case of a node failure. While Cassandra guarantees high availability, the influence of a node failure on the system performance is still unclear. In this paper, we therefore focus on the availability and fault tolerance of Cassandra. We analyze the impact of a node outage within a Cassandra cluster on the throughput and latency for different workloads. Our results show that Cassandra is well suited to achieve high availability while preserving table response times in case of a node failure. Especially for read intensive applications that require high availability, Cassandra is a good choice.

Cite

CITATION STYLE

APA

Rosselli, M., Niemann, R., Ivanov, T., Tolle, K., & Zicari, R. V. (2016). Benchmarking the availability and fault tolerance of Cassandra. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10044, pp. 87–95). Springer Verlag. https://doi.org/10.1007/978-3-319-49748-8_5

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free