Accelerating Lattice Boltzmann Method by Fully Exposing Vectorizable Loops

Bin Qu; Song Liu; Hailong Huang; Jiajun Yuan; Qian Wang; Weiguo Wu

Conference Proceedings

Accelerating Lattice Boltzmann Method by Fully Exposing Vectorizable Loops

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 11944 LNCS 107-121

DOI: 10.1007/978-3-030-38991-8_8

0Citations

2Readers

Get full text

Abstract

Lattice Boltzmann Method (LBM) plays an important role in CFD applications. Accelerating LBM computation indicates the decrease of simulation costs for many industries. However, the loop-carried dependencies in LBM kernels prevent the vectorization of loops and general compilers therefore have missed many opportunities of vectorization. This paper proposes a SIMD-aware loop transformation algorithm to fully expose vectorizable loops for LBM kernels. The proposed algorithm identifies most potential vectorizable loops according to a defined dependence table. Then, it performs appropriate loop transformations and array copying techniques to legalize loop-carried dependencies and makes the identified loops automatically vectorized by compiler. Experiments carried on an Intel Xeon Gold 6140 server show that the proposed algorithm significantly raises the ratio of number of vectorized loops to number of all loops in LBM kernels. And our algorithm also achieves a better performance than an Intel C++ compiler and a polyhedral optimizer, accelerating LBM computation by 147% and 120% on average lattice update speed, respectively.

Author supplied keywords

Cite

CITATION STYLE

APA

Qu, B., Liu, S., Huang, H., Yuan, J., Wang, Q., & Wu, W. (2020). Accelerating Lattice Boltzmann Method by Fully Exposing Vectorizable Loops. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11944 LNCS, pp. 107–121). Springer. https://doi.org/10.1007/978-3-030-38991-8_8

Accelerating Lattice Boltzmann Method by Fully Exposing Vectorizable Loops

Abstract

Author supplied keywords

Cite

Register to see more suggestions