Abstract
Dynamic parallelism allows GPU kernels to launch additional kernels at runtime directly from the GPU. In this paper we show that dynamic parallelism enables relatively simple high-performance graph algorithms for GPUS. We present breadth-first search (BFS) and single-source shortest paths (SSSP) algorithms that use dynamic parallelism to adapt to the irregular and data-driven nature of these problems. Our approach results in simple code that closely follows the highlevel description of the algorithms but yields performance competitive with the current state of the art.
Cite
CITATION STYLE
Zhang, P., Holk, E., Matty, J., Misurda, S., Zalewski, M., Chu, J., … Lumsdaine, A. (2015). Dynamic parallelism for simple and efficient GPU graph algorithms. In Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms, IA3 2015. Association for Computing Machinery, Inc. https://doi.org/10.1145/2833179.2833189
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.