Abstract
Characterizing compute kernel execution behavior on GPUs for efficient task scheduling is a non-trivial task. We address this with a simple model enabling portable and fast predictions among different GPUs using only hardware-independent features. This model is built based on random forests using 189 individual compute kernels from benchmarks such as Parboil, Rodinia, Polybench-GPU, and SHOC. Evaluation of the model performance using cross-validation yields a median Mean Average Percentage Error (MAPE) of 8.86-52.0% for time and 1.84-2.94% for power prediction across five different GPUs, while latency for a single prediction varies between 15 and 108 ms.
Author supplied keywords
Cite
CITATION STYLE
Braun, L., Nikas, S., Song, C., Heuveline, V., & Fröning, H. (2021). A Simple Model for Portable and Fast Prediction of Execution Time and Power Consumption of GPU Kernels. ACM Transactions on Architecture and Code Optimization, 18(1). https://doi.org/10.1145/3431731
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.