Processing-in-Memory Acceleration of MAC-based Applications Using Residue Number System: A Comparative Study

6Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Processing-in-memory (PIM) has raised as a viable solution for the memory wall crisis and has attracted great interest in accelerating computationally intensive AI applications ranging from filtering to complex neural networks. In this paper, we try to take advantage of both PIM and the residue number system (RNS) as an alternative for the conventional binary number representation to accelerate multiplication-And-Accumulations (MACs), primary operations of target applications. The PIM architecture utilizes the maximum internal bandwidth of memory chips to realize a local and parallel computation to eliminates the off-chip data transfer. Moreover, RNS limits inter-digit carry propagation by performing arithmetic operations on small residues independently and in parallel. Thus, we develop a PIM-RNS, entitled PRIMS, and analyze the potential of intertwining PIM architecture with the inherent parallelism of the RNS arithmetic to delineate the opportunities and challenges. To this end, we build a comprehensive device-To-Architecture evaluation framework to quantitatively study this problem considering the impact of PIM technology for a well-known three-moduli set as a case study.

Cite

CITATION STYLE

APA

Angizi, S., Roohi, A., Taheri, M., & Fan, D. (2021). Processing-in-Memory Acceleration of MAC-based Applications Using Residue Number System: A Comparative Study. In Proceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI (pp. 265–270). Association for Computing Machinery. https://doi.org/10.1145/3453688.3461529

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free