Addressing GPU on-chip shared memory bank conflicts using elastic pipeline

Chunyang Gou; Georgi N. Gaydadjiev

Conference ProceedingsOPEN ACCESS

Addressing GPU on-chip shared memory bank conflicts using elastic pipeline

International Journal of Parallel Programming (2013) 41(3) 400-429

DOI: 10.1007/s10766-012-0201-1

11Citations

23Readers

Abstract

One of the major problems with the GPU on-chip shared memory is bank conflicts. We analyze that the throughput of the GPU processor core is often constrained neither by the shared memory bandwidth, nor by the shared memory latency (as long as it stays constant), but is rather due to the varied latencies caused by memory bank conflicts. This results in conflicts at the writeback stage of the in-order pipeline and causes pipeline stalls, thus degrading system throughput. Based on this observation, we investigate and propose a novel Elastic Pipeline design that minimizes the negative impact of on-chip memory bank conflicts on system throughput, by decoupling bank conflicts from pipeline stalls. Simulation results show that our proposed Elastic Pipeline together with the co-designed bank-conflict aware warp scheduling reduces the pipeline stalls by up to 64.0 % (with 42.3 % on average) and improves the overall performance by up to 20.7 % (on average 13.3 %) for representative benchmarks, at trivial hardware overhead. © 2012 The Author(s).

Author supplied keywords

Cite

CITATION STYLE

APA

Gou, C., & Gaydadjiev, G. N. (2013). Addressing GPU on-chip shared memory bank conflicts using elastic pipeline. In International Journal of Parallel Programming (Vol. 41, pp. 400–429). https://doi.org/10.1007/s10766-012-0201-1

Addressing GPU on-chip shared memory bank conflicts using elastic pipeline

Abstract

Author supplied keywords

Cite

Register to see more suggestions