Optimizing for KNL usage modes when data doesn’t fit in MCDRAM

10Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Technologies such as Multi-Channel DRAM (MCDRAM) or High Bandwidth Memory (HBM) provide significantly more bandwidth than conventional memory. This trend has raised questions about how applications should manage data transfers between levels. This paper focuses on evaluating different usage modes of the MCDRAM in Intel Knights Landing (KNL) manycore processors. We evaluate these usage modes with a sorting kernel and a sorting-based streaming benchmark. We develop a performance model for the benchmark and use experimental evidence to demonstrate the correctness of the model. The model projects near-optimal numbers of copy threads for memory bandwidth bound computations. We demonstrate on KNL up to a 1.9X speedup for sort when the problem does not fit in MCDRAM over an OpenMP GNU sort that does not use MCDRAM.

Cite

CITATION STYLE

APA

Butcher, N., Olivier, S. L., Berry, J., Hammond, S. D., & Kogge, P. M. (2018). Optimizing for KNL usage modes when data doesn’t fit in MCDRAM. In ACM International Conference Proceeding Series. Association for Computing Machinery. https://doi.org/10.1145/3225058.3225116

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free