POSTER: Provably Efficient Scheduling of Cache-Oblivious Wavefront Algorithms

0Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Standard cache-oblivious recursive divide-and-conquer algorithms for evaluating dynamic programming recurrences have optimal serial cache complexity but often have lower parallelism compared with iterative wavefront algorithms due to artificial dependencies among subtasks. Very recently cache-oblivious recursive wavefront (COW) algorithms have been introduced which do not have any artificial dependencies. Though COW algorithms are based on fork-join primitives, they extensively use atomic operations, and as a result, performance guarantees provided by state-of-the-art schedulers for programs with fork-join primitives do not apply. In this work, we show how to systematically transform standard cache-oblivious recursive divide-and-conquer algorithms into recursive wavefront algorithms to achieve optimal parallel cache complexity and high parallelism under state-of-the-art schedulers for fork-join programs. Unlike COW algorithms these new algorithms do not use atomic operations. Instead, they use closed-form formulas to compute at what time each recursive function must be launched in order to achieve high parallelism without losing cache performance. The resulting implementations are arguably much simpler than implementations of known COW algorithms.

Cite

CITATION STYLE

APA

Chowdhury, R., Ganapathi, P., Tang, Y., & Tithi, J. J. (2017). POSTER: Provably Efficient Scheduling of Cache-Oblivious Wavefront Algorithms. ACM SIGPLAN Notices, 52(8), 435–436. https://doi.org/10.1145/3018743.3019031

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free