Matrix Multiplication on Digital Signal Processors and Hierarchical Memory Systems

  • Palacios I
  • Medina M
  • Moreno J
N/ACitations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We discuss mapping the matrix multiplication algorithm onto a two-level hierarchical memory system which incorporates DMA capabilities between levels, as available on Digital Signal Processors (DSPs). We show that it is possible to hide the hierarchical nature of the memory system from the processor, so that computations can proceed at the processor's speed. This is accomplished by the use of a block algorithm, and by prefetching data from the slower second-level memory into the faster but smaller first-level memory under DMA control. The Texas Instruments TMS 320C30 Digital Signal Processor is used as an example, and performance estimates for different memory timings are given. These results are also compared to the performance of executing the matrix multiplication algorithm without exploiting the DMA capabilities.

Cite

CITATION STYLE

APA

Palacios, I., Medina, M., & Moreno, J. (1992). Matrix Multiplication on Digital Signal Processors and Hierarchical Memory Systems. In Computer Science (pp. 473–483). Springer US. https://doi.org/10.1007/978-1-4615-3422-8_39

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free