Battling memory requirements of array programming through streaming

Mads R.B. Kristensen; James Avery; Troels Blum; Simon Andreas Frimann Lund; Brian Vinter

Conference Proceedings

Battling memory requirements of array programming through streaming

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9945 LNCS 451-469

DOI: 10.1007/978-3-319-46079-6_32

3Citations

1Readers

Get full text

Abstract

A barrier to efficient array programming, for example in Python/NumPy, is that algorithms written as pure array operations completely without loops, while most efficient on small input, can lead to explosions in memory use. The present paper presents a solution to this problem using array streaming, implemented in the automatic parallelization high-performance framework Bohrium. This makes it possible to use array programming in Python/NumPy code directly, even when the apparent memory requirement exceeds the machine capacity, since the automatic streaming eliminates the temporary memory overhead by performing calculations in per-thread registers. Using Bohrium, we automatically fuse, JIT-compile, and execute NumPy array operations on GPGPUs without modification to the user programs. We present performance evaluations of three benchmarks, all of which show dramatic reductions in memory use from streaming, yielding corresponding improvements in speed and utilization of GPGPUcores. The streaming-enabled Bohrium effortlessly runs programs on input sizes much beyond sizes that crash on pure NumPy due to exhausting system memory.

Cite

CITATION STYLE

APA

Kristensen, M. R. B., Avery, J., Blum, T., Lund, S. A. F., & Vinter, B. (2016). Battling memory requirements of array programming through streaming. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9945 LNCS, pp. 451–469). Springer Verlag. https://doi.org/10.1007/978-3-319-46079-6_32

Battling memory requirements of array programming through streaming

Abstract

Cite

Register to see more suggestions