MOCHA: Multinode cost optimization in heterogeneous clouds with accelerators

Peipei Zhou; Jiayi Sheng; Cody Hao Yu; Peng Wei; Jie Wang; Di Wu; Jason Cong

Conference ProceedingsOPEN ACCESS

MOCHA: Multinode cost optimization in heterogeneous clouds with accelerators

FPGA 2021 - 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (2021) 273-279

DOI: 10.1145/3431920.3439304

11Citations

14Readers

Get full text

Abstract

FPGAs have been widely deployed in public clouds, e.g., Amazon Web Services (AWS) and Huawei Cloud. However, simply offloading accelerated kernels from CPU hosts to PCIe-based FPGAs does not guarantee out-of-pocket cost savings in a pay-as-you-go public cloud. Taking Genome Analysis Toolkit (GATK) applications as case studies, although the adoption of FPGAs reduces the overall execution time, it introduces 2.56× extra cost, due to insufficient application-level speedup by Amdahl's law. To optimize the out-of-pocket cost while keeping high speedup and throughput, we propose Mocha framework as a distributed runtime system to fully utilize the accelerator resource by accelerator sharing and CPU-FPGA partial task offloading. Evaluation results on Haplotype Caller (HTC) and Mutect2 in GATK show that on AWS, Mocha saves on the application cost by 2.82x for HTC, 1.06x for Mutect2 and on Huawei Cloud by 1.22x, 1.52x respectively than straightforward CPU-FPGA integration solution with less than 5.1% performance overhead.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhou, P., Sheng, J., Yu, C. H., Wei, P., Wang, J., Wu, D., & Cong, J. (2021). MOCHA: Multinode cost optimization in heterogeneous clouds with accelerators. In FPGA 2021 - 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (pp. 273–279). Association for Computing Machinery, Inc. https://doi.org/10.1145/3431920.3439304

MOCHA: Multinode cost optimization in heterogeneous clouds with accelerators

Abstract

Author supplied keywords

Cite

Register to see more suggestions