Adaptively mapping code in an intelligent memory architecture

Yan Solihin; Jaejin Lee; Josep Torrellas

Conference Proceedings

Adaptively mapping code in an intelligent memory architecture

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2001) 2107 71-84

DOI: 10.1007/3-540-44570-6_5

0Citations

1Readers

Get full text

Abstract

This paper presents an algorithm to automatically map code to a generic Processor-In-Memory (PIM) system that consists of a host processor and a much simpler memory processor. To achieve high performance with this type of architecture, code needs to be partitioned and scheduled such that each section is assigned to the processor on which it runs most efficiently. In addition, processors should overlap their execution as much as possible. Our algorithm is embedded in a compiler and run-time system and maps applications fully automatically using both static and dynamic information. Using a set of applications and a simulated architecture, we show average speedups of 1.7 over a single host with plain memory. The speedups are very close and often higher than ideal speedups on a more expensive multiprocessor system composed of two identical host processors. Our work shows that heterogeneity can be cost-effectively exploited, and represents one step toward effectively mapping code to more advanced PIM systems.

Cite

CITATION STYLE

APA

Solihin, Y., Lee, J., & Torrellas, J. (2001). Adaptively mapping code in an intelligent memory architecture. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2107, pp. 71–84). Springer Verlag. https://doi.org/10.1007/3-540-44570-6_5

Adaptively mapping code in an intelligent memory architecture

Abstract

Cite

Register to see more suggestions