Accelerating assembly operation in element-by-element FEM on multicore platforms

0Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The speedup of element-by-element FEM algorithms depends not only on the peak processor performance but also on the access time to shared mesh data. Eliminating memory boundness would significantly speedup unstructured mesh computations on hybrid multicore architectures, where the gap between processor and memory performance continues to grow. The speedup can be achieved by ordering unknowns so that only those elements are processed in parallel which do not have common nodes. If vectors are composed with respect to the ordering, memory conflicts will be minimized. Mesh was partitioned into layers by using neighborhood relationship. We evaluated several partitioning schemes (block, odd-even parity, and their modifications) on multicore platforms, using Gunther’s Universal Law of Computational Scalability. We performed numerical experiments with element-by-element matrix-vector multiplication on unstructured meshes on multicore processors accelerated by MIC and GPU. We achieved 5-times speedup on CPU, 40-times — on MIC, and 200-times — on GPU.

Cite

CITATION STYLE

APA

Kopysov, S., Novikov, A., Nedozhogin, N., & Rychkov, V. (2016). Accelerating assembly operation in element-by-element FEM on multicore platforms. In Communications in Computer and Information Science (Vol. 687, pp. 3–14). Springer Verlag. https://doi.org/10.1007/978-3-319-55669-7_1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free