Cache performance optimizations for parallel lattice boltzmann codes

15Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

When designing and implementing highly efficient scientific applications for parallel computers such as clusters of workstations, it is inevitable to consider and to optimize the single-CPU performance of the codes. For this purpose, it is particularly important that the codes respect the hierarchical memory designs that computer architects employ in order to hide the effects of the growing gap between CPU performance and main memory speed. In this paper, we present techniques to enhance the single-CPU efficiency of lattice Boltzmann methods which are commonly used in computational fluid dynamics. We show various performance results to emphasize the effectiveness of our optimization techniques. © Springer-Verlag 2003.

Cite

CITATION STYLE

APA

Wilke, J., Pohl, T., Kowarschik, M., & Rüde, U. (2004). Cache performance optimizations for parallel lattice boltzmann codes. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2790, 441–450. https://doi.org/10.1007/978-3-540-45209-6_66

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free