Dynamic cluster assignment mechanisms

72Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Clustered microarchitectures are an effective approach to reducing the penalties caused by wire delays inside a chip. Current superscalar processors have in fact a two-cluster microarchitecture with a naive code partitioning approach: integer instructions are allocated to one cluster and floating-point instructions to the other. This partitioning scheme is simple and results in no communications between the two clusters (just through memory) but it is in general far from optimal because the workload is not evenly distributed most of the time. In fact, when the processor is running integer programs, the workload is extremely unbalanced since the FP cluster is not used at all. In this work we investigate run-time mechanisms that dynamically distribute the instructions of a program among these two clusters. By optimizing the trade-off between inter-cluster communication penalty and workload balance, the proposed schemes can achieve an average speed-up of 36% for the SpecInt95 benchmark suite.

Cite

CITATION STYLE

APA

Canal, R., Parcerisa, J. M., & González, A. (2000). Dynamic cluster assignment mechanisms. In IEEE High-Performance Computer Architecture Symposium Proceedings (pp. 133–142). https://doi.org/10.1109/hpca.2000.824345

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free