Attaining performance in the evaluation of two-electron repulsion integrals and constructing the Fock matrix is of considerable importance to the computational chemistry community. Due to its numerical complexity improving the performance behavior across a variety of leading supercomputing platforms is an increasing challenge due to the significant diversity in high-performance computing architectures. In this paper, we present our successful tuning methodology for these important numerical methods on the Cray XE6, the Cray XC30, the IBM BG/Q, as well as the Intel Xeon Phi. Our optimization schemes leverage key architectural features including vectorization and simultaneous multithreading, and results in speedups of up to 2.5x compared with the original implementation.
CITATION STYLE
Shan, H., Austin, B., De Jong, W., Oliker, L., Wright, N. J., & Apra, E. (2014). Performance tuning of fock matrix and two-electron integral calculations for NWchem on leading HPC platforms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8551, pp. 261–280). Springer Verlag. https://doi.org/10.1007/978-3-319-10214-6_13
Mendeley helps you to discover research relevant for your work.