We developed MDGRAPE-4A, a special-purpose computer system for molecular dynamics simulations, consisting of 512 nodes of custom system-ona-chip LSIs with dedicated processor cores and interconnects designed to achieve strong scalability for biomolecular simulations. To reduce the global communications required for the evaluation of Coulomb interactions, we conducted a co-design of the MDGRAPE-4A and the novel algorithm, tensor-structured multilevel Ewald summation method (TME), which produced hardware modules on the custom LSI circuit for particle grid operations and for grid grid separable convolutions on a 3D torus network. We implemented the convolution for the top-level grid potentials by using 3D FFTs on an FPGA, along with an FPGA-based octree network to gather grid charges the elapsed time for the long-range part of Coulomb is 50 us, which can mostly overlap with those for the short-range part, and the additional cost is approximately 10 us/step, which is only a 5% performance loss.
CITATION STYLE
Morimoto, G., Koyama, Y. M., Zhang, H., Komatsu, T. S., Ohno, Y., Nishida, K., … Taiji, M. (2021). Hardware acceleration of tensor-structured multilevel ewald summation method on mdgrape-4a, a special-purpose computer system for molecular dynamics simulations. In International Conference for High Performance Computing, Networking, Storage and Analysis, SC. IEEE Computer Society. https://doi.org/10.1145/3458817.3476190
Mendeley helps you to discover research relevant for your work.