Platform-independent cache optimization by pinpointing low-locality reuse

Kristof Beyls; Erik H. D'Hollander

Journal ArticleOPEN ACCESS

Platform-independent cache optimization by pinpointing low-locality reuse

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3038 448-455

DOI: 10.1007/978-3-540-24688-6_59

4Citations

3Readers

Abstract

For many applications, cache misses are the primary performance bottleneck. Even though much research has been performed on automatically optimizing cache behavior at the hardware and the compiler level, many program executions remain dominated by cache misses. Therefore, we propose to let the programmer optimize, who has a better high-level program overview, needed to resolve many cache problems. In order to assist the programmer, a visualization of memory accesses with poor locality is developed. The aim is to indicate causes of cache misses independent of actual cache parameters such as associativity or size. In that way, the programmer is steered towards platform-independent locality optimizations. The visualization was applied to three programs from the SPEC2000 benchmarks. After optimizing the source code based on the visualization, an average speedup of 3.06 was obtained on different platforms with Athlon, Itanium and Alpha processors; indicating the feasibility of platform-independent cache optimizations. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Beyls, K., & D’Hollander, E. H. (2004). Platform-independent cache optimization by pinpointing low-locality reuse. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3038, 448–455. https://doi.org/10.1007/978-3-540-24688-6_59

Platform-independent cache optimization by pinpointing low-locality reuse

Abstract

Cite

Register to see more suggestions