Optimizing inter-data-center large-scale database parallel replication with workload-driven partitioning

Zhen Gao; Hong Min; Xiao Li; Jie Huang; Yi Jin; An Lei; Serge Bourbonnais; Miao Zheng; Gene Fuh

Conference Proceedings

Optimizing inter-data-center large-scale database parallel replication with workload-driven partitioning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9510 169-192

DOI: 10.1007/978-3-662-49214-7_6

3Citations

2Readers

Get full text

Abstract

Geographically distributed data centers are deployed for non-stop business operations by many enterprises. In case of disastrous events, ongoing workloads must be failed over from the current data center to another active one within just a few seconds to achieve continuous service availability. Softwarebased parallel database replication techniques are designed to meet very high throughput with near-real-time latency. Understanding workload characteristics is one of the key factors for improving replication performance. In this paper, we propose a workload-driven method to optimize database replication latency and minimize transaction splits with a minimum of parallel replication consistency groups. Our two-phased approach includes (1) a log-based mechanism for workload pattern discovery; (2) a history-based algorithm on pattern analysis, database partitioning and partition adjustment. The experimental results from a real banking batch workload and a benchmark OLTP workload demonstrate the effectiveness of the solution even for partitioning 1000 s of database tables in very large workloads. Finally, the algorithm to automate the cyclic flow of workload profile capturing and partitioning readjustment is developed and verified.

Cite

CITATION STYLE

APA

Gao, Z., Min, H., Li, X., Huang, J., Jin, Y., Lei, A., … Fuh, G. (2016). Optimizing inter-data-center large-scale database parallel replication with workload-driven partitioning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9510, pp. 169–192). Springer Verlag. https://doi.org/10.1007/978-3-662-49214-7_6

Optimizing inter-data-center large-scale database parallel replication with workload-driven partitioning

Abstract

Cite

Register to see more suggestions