Many slowdown models have been proposed to characterize memory interference ofworkloads co-running on heterogeneous Systemon- Chips (SoCs). But they are mostly for post-silicon usage. How to effectively consider memory interference in the SoC design stage remains an open problem. This paper presents a new approach to this problem, consisting of a novel processor-centric slowdown modeling methodology and a new three-region interference-conscious slowdown model. The modeling process needs no measurement of corunning of various combinations of applications, but the produced slowdown models can be used to estimate the co-run slowdowns of arbitrary workloads on various SoC designs that embed a newer generation of accelerators, such as deep learning accelerators (DLA), in addition to CPUs and GPUs. The new method reduces average prediction errors of the state-of-art model from 30.3% to 8.7% on GPU, from 13.4% to 3.7% on CPU, from 20.6% to 5.6% on DLA and demonstrates much improved efficacy in guiding SoC designs.
CITATION STYLE
Xu, Y., Belviranli, M. E., Shen, X., & Vetter, J. (2021). PCCS: Processor-centric contention-aware slowdown model for heterogeneous system-on-chips. In Proceedings of the Annual International Symposium on Microarchitecture, MICRO (pp. 1282–1295). IEEE Computer Society. https://doi.org/10.1145/3466752.3480101
Mendeley helps you to discover research relevant for your work.