As compared to a complex single processor based system, on-chip multiprocessors are less complex, more power efficient, and easier to test and validate. In this work, we focus on an on-chip multiprocessor where each processor has a local memory (or cache). We demonstrate that, in such an architecture, allowing each processor to do off-chip memory requests on behalf of other processors can improve overall performance over a straightforward strategy, where each processor performs off-chip requests independently. Our experimental results obtained using six benchmark codes indicate large execution cycle savings over a wide range of architectural configurations. © Springer-Verlag 2003.
CITATION STYLE
Chen, G., Kandemir, M., Kolcu, I., & Choudhary, A. (2004). Exploiting on-chip data transfers for improving performance of chip-scale multiprocessors. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2790, 271–278. https://doi.org/10.1007/978-3-540-45209-6_41
Mendeley helps you to discover research relevant for your work.