PCCS: Processor-centric contention-aware slowdown model for heterogeneous system-on-chips

6Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Many slowdown models have been proposed to characterize memory interference ofworkloads co-running on heterogeneous Systemon- Chips (SoCs). But they are mostly for post-silicon usage. How to effectively consider memory interference in the SoC design stage remains an open problem. This paper presents a new approach to this problem, consisting of a novel processor-centric slowdown modeling methodology and a new three-region interference-conscious slowdown model. The modeling process needs no measurement of corunning of various combinations of applications, but the produced slowdown models can be used to estimate the co-run slowdowns of arbitrary workloads on various SoC designs that embed a newer generation of accelerators, such as deep learning accelerators (DLA), in addition to CPUs and GPUs. The new method reduces average prediction errors of the state-of-art model from 30.3% to 8.7% on GPU, from 13.4% to 3.7% on CPU, from 20.6% to 5.6% on DLA and demonstrates much improved efficacy in guiding SoC designs.

Cite

CITATION STYLE

APA

Xu, Y., Belviranli, M. E., Shen, X., & Vetter, J. (2021). PCCS: Processor-centric contention-aware slowdown model for heterogeneous system-on-chips. In Proceedings of the Annual International Symposium on Microarchitecture, MICRO (pp. 1282–1295). IEEE Computer Society. https://doi.org/10.1145/3466752.3480101

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free