Automated accelerator generation and optimization with composable, parallel and pipeline architecture

42Citations
Citations of this article
52Readers
Mendeley users who have this article in their library.

Abstract

CPU-FPGA heterogeneous architectures feature flexible acceleration of many workloads to advance computational capabilities and energy efficiency in today's datacenters. This advantage, however, is often overshadowed by the poor programmability of FPGAs. Although recent advances in high-level synthesis (HLS) significantly improve the FPGA programmability, it still leaves programmers facing the challenge of identifying the optimal design configuration in a tremendous design space. In this paper we propose the composable, parallel and pipeline (CPP) microarchitecture as an accelerator design template to substantially reduce the design space. Also, by introducing the CPP analytical model to capture the performanceresource trade-offs, we achieve efficient, analytical-based design space exploration. Furthermore, we develop the AutoAccel framework to automate the entire accelerator generation process. Our experiments show that the AutoAccel-generated accelerators outperform their corresponding software implementations by an average of 72x for a broad class of computation kernels.

References Powered by Scopus

A general method applicable to the search for similarities in the amino acid sequence of two proteins

8694Citations
N/AReaders
Get full text

A reconfigurable fabric for accelerating large-scale datacenter services

866Citations
N/AReaders
Get full text

High-level synthesis for FPGAs: From prototyping to deployment

650Citations
N/AReaders
Get full text

Cited by Powered by Scopus

HeteroCL: A multi-paradigm programming infrastructure for software-defined reconfigurable computing

107Citations
N/AReaders
Get full text

ThunderGP: HLS-based graph processing framework on fpgas

73Citations
N/AReaders
Get full text

AutoDSE: Enabling Software Programmers to Design Efficient FPGA Accelerators

42Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Cong, J., Wei, P., Yu, C. H., & Zhang, P. (2018). Automated accelerator generation and optimization with composable, parallel and pipeline architecture. In Proceedings - Design Automation Conference (Vol. Part F137710). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1145/3195970.3195999

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 24

75%

Researcher 4

13%

Lecturer / Post doc 3

9%

Professor / Associate Prof. 1

3%

Readers' Discipline

Tooltip

Computer Science 22

65%

Engineering 10

29%

Arts and Humanities 1

3%

Neuroscience 1

3%

Save time finding and organizing research with Mendeley

Sign up for free