Abstract
Technologies such as Multi-Channel DRAM (MCDRAM) or High Bandwidth Memory (HBM) provide significantly more bandwidth than conventional memory. This trend has raised questions about how applications should manage data transfers between levels. This paper focuses on evaluating different usage modes of the MCDRAM in Intel Knights Landing (KNL) manycore processors. We evaluate these usage modes with a sorting kernel and a sorting-based streaming benchmark. We develop a performance model for the benchmark and use experimental evidence to demonstrate the correctness of the model. The model projects near-optimal numbers of copy threads for memory bandwidth bound computations. We demonstrate on KNL up to a 1.9X speedup for sort when the problem does not fit in MCDRAM over an OpenMP GNU sort that does not use MCDRAM.
Author supplied keywords
Cite
CITATION STYLE
Butcher, N., Olivier, S. L., Berry, J., Hammond, S. D., & Kogge, P. M. (2018). Optimizing for KNL usage modes when data doesn’t fit in MCDRAM. In ACM International Conference Proceeding Series. Association for Computing Machinery. https://doi.org/10.1145/3225058.3225116
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.