ArkDB: A Key-Value Engine for Scalable Cloud Storage Services

10Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Persistent key-value stores play a crucial role in enabling internet-scale services. At Alibaba Cloud, scale-out cloud storage services including Object Storage Service, File Storage Service and Tablestore are built on distributed key-value stores. Key challenges in the design of the underlying key-value engine for these services lie in utilization of disaggregated storage, supporting write and range query-heavy workloads, and balancing of scalability, availability and resource usage. This paper presents ArkDB, a key-value engine designed to address these challenges by combining advantages of both LSM tree and Bw-tree, and leveraging advances in hardware technologies. Built on top of Pangu, an append-only distributed file system, ArkDB's innovations include shrinkable page mapping table, clear separation of system and user states for fast recovery, write amplification reduction, efficient garbage collection and lightweight partition split and merge. Experimental results demonstrate ArkDB's improvements over existing designs. Compared with Bw-tree, ArkDB efficiently stabilizes the mapping table size despite continuous write working set growth. Compared with RocksDB, an LSM tree-based key-value engine, ArkDB increases ingestion throughput by 2.16x, while reducing write amplification by 3.1x. It outperforms RocksDB by 52% and 37% respectively on a write-heavy workload and a range query-intensive workload of the Yahoo! Cloud Serving Benchmark. Experiments running in Tablestore in a cluster environment further demonstrate ArkDB's performance on Pangu and its efficient partition split/merge support.

References Powered by Scopus

The google file system

4518Citations
N/AReaders
Get full text

The Hadoop distributed file system

3911Citations
N/AReaders
Get full text

Benchmarking cloud serving systems with YCSB

2957Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A maturity model for AI-empowered cloud-native databases: from the perspective of resource management

3Citations
N/AReaders
Get full text

Mask–Mediator–Wrapper: A Revised Mediator–Wrapper Architecture for Heterogeneous Data Source Integration

2Citations
N/AReaders
Get full text

LETUS: A Log-Structured Efficient Trusted Universal BlockChain Storage

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Pang, Z., Lu, Q., Chen, S., Wang, R., Xu, Y., & Wu, J. (2021). ArkDB: A Key-Value Engine for Scalable Cloud Storage Services. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 2570–2583). Association for Computing Machinery. https://doi.org/10.1145/3448016.3457553

Readers over time

‘21‘22‘23‘24036912

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 6

55%

Researcher 5

45%

Readers' Discipline

Tooltip

Computer Science 10

83%

Engineering 1

8%

Earth and Planetary Sciences 1

8%

Save time finding and organizing research with Mendeley

Sign up for free
0