Abstract
We describe the implementation and performance results of our massively parallel MPI†/OpenMP‡ hybrid TreePM code for large-scale cosmological N-body simulations. For domain decomposition, a recursive multi-section algorithm is used and the size of domains are automatically set so that the total calculation time is the same for all processes. We developed a highly-tuned gravity kernel for short-range forces, and a novel communication algorithm for long-range forces. For two trillion particles benchmark simulation, the average performance on the fullsystem of K computer (82,944 nodes, the total number of core is 663,552) is 5.8 Pflops, which corresponds to 55% of the peak speed.
Author supplied keywords
Cite
CITATION STYLE
Ishiyama, T. (2016). Supercomputer simulations of structure formation in the Universe. In Proceedings of the International Astronomical Union (Vol. 12, pp. 10–16). Cambridge University Press. https://doi.org/10.1017/S174392131700045X
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.