Coarse-Grained Reconfigurable Architecture (CGRA) is a high-performance computing architecture. However, existing CGRA silicon utilization is low due to the lack of fine-grained parallelism inside Processing Element (PE) and general coarse-grained parallel approach on PE array. No fine-grained parallelism in PE not only leads to low silicon utilization of PE, but also makes the mapping loose and irregular. No generalized parallel method for the mapping cause low PE utilization on CGRA. Our goal is to design an execution model and a Mixed-granularity Parallel CGRA (MP-CGRA), which is capable to fine-grained parallelize operators excution in PEs and parallelize data transmission in channels, leading to a compact mapping. A coarse-grained general parallel method is proposed to vectorize the compact mapping. Evaluated with Machsuite, MP-CGRA achieves an improvement of 104.65% silicon utilization on PE array and a 91.40% performance per area improvement compared with baseline-CGRA.
CITATION STYLE
Deng, J., Zhang, L., Wang, L., Liu, J., Deng, K., Tang, S., … Yin, S. (2022). Mixed-granularity parallel coarse-grained reconfigurable architecture. In Proceedings - Design Automation Conference (pp. 343–348). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1145/3489517.3530454
Mendeley helps you to discover research relevant for your work.