Abstract
This paper describes a method to improve the cache locality of sequential programs by scheduling fine-grained threads. The algorithm relies upon hints provided at the time of thread creation to determine a thread execution order likely to reduce cache misses. This technique may be particularly valuable when compiler-directed tiling is not feasible. Experiments with several application programs, on two systems with different cache structures, show that our thread scheduling method can improve program performance by reducing second-level cache misses.
Cite
CITATION STYLE
Philbin, J., Edler, J., Anshus, O. J., Douglas, C. C., & Li, K. (1996). Thread scheduling for cache locality. Operating Systems Review (ACM), 30(5), 60–71. https://doi.org/10.1145/248208.237151
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.