Various hardware-based Erasure Coding (EC) schemes have been proposed [5, 6, 8, 12–14] to leverage the advanced compute capabilities on modern data centers. Currently, there is no unified and easy way for distributed storage systems to fully exploit multiple devices such as CPUs, GPUs, and network devices (i.e., multi-rail support) to perform EC operations in parallel. In this paper, we validate that it is time to design an unified library to efficiently exploit heterogeneous EC coders. HDFS co-designed with our proposed library outperforms the write performance of replication scheme and the default HDFS EC coder by 2.7x - 6.1x and 2.4x - 3.3x, respectively, and improves the performance of read with failure recoveries by up to 2.6x and 5.1x compared to the replication scheme and the default HDFS EC coder, respectively.
CITATION STYLE
Shi, H., Lu, X., Shankar, D., & Panda, D. K. (2018). High-performance multi-rail erasure coding library over modern data center architectures: Early experiences. In SoCC 2018 - Proceedings of the 2018 ACM Symposium on Cloud Computing (p. 530). Association for Computing Machinery, Inc. https://doi.org/10.1145/3267809.3275472
Mendeley helps you to discover research relevant for your work.