A parallel Lattice Boltzmann Method (pLBM), which is based on hierarchical spatial decomposition, is designed to perform large-scale flow simulations. The algorithm uses critical section-free, dual representation in order to expose maximal concurrency and data locality. Performances of emerging multi-core platforms-PlayStation3 (Cell Broadband Engine) and Compute Unified Device Architecture (CUDA)-are tested using the pLBM, which is implemented with multi-thread and message-passing programming. The results show that pLBM achieves good performance improvement, 11.02 for Cell over a traditional Xeon cluster and 8.76 for CUDA graphics processing unit (GPU) over a Sempron central processing unit (CPU). The results provide some insights into application design on future many-core platforms. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Peng, L., Nomura, K. I., Oyakawa, T., Kalia, R. K., Nakano, A., & Vashishta, P. (2008). Parallel lattice boltzmann flow simulation on emerging multi-core platforms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5168 LNCS, pp. 763–777). https://doi.org/10.1007/978-3-540-85451-7_81
Mendeley helps you to discover research relevant for your work.