4.7 Article

Faster Self-Consistent Field (SCF) Calculations on GPU Clusters

期刊

JOURNAL OF CHEMICAL THEORY AND COMPUTATION
卷 17, 期 12, 页码 7486-7503

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jctc.1c00720

关键词

-

资金

  1. Department of Energy Exascale Computing Project (ECP) [17-SC-20-SC]

向作者/读者索取更多资源

The novel implementation of the self-consistent field (SCF) procedure is specifically designed for high-performance execution on multiple graphics processing units (GPUs), showing remarkable speedups for various test molecules and basis sets compared to existing GPU codes. Strong scaling calculations demonstrate nearly ideal scalability up to 8 GPUs while maintaining high parallel efficiency for up to 18 GPUs.
A novel implementation of the self-consistent field (SCF) procedure specifically designed for high-performance execution on multiple graphics processing units (GPUs) is presented. The algorithm offloads to GPUs the three major computational stages of the SCF, namely, the calculation of one-electron integrals, the calculation and digestion of electron repulsion integrals, and the diagonalization of the Fock matrix, including SCF acceleration via DIIS. Performance results for a variety of test molecules and basis sets show remarkable speedups with respect to the state-of-the-art parallel GAMESS CPU code and relative to other widely used GPU codes for both single and multi-GPU execution. The new code outperforms all existing multi-GPU implementations when using eight V100 GPUs, with speedups relative to Terachem ranging from 1.2X to 3.3X and speedups of up to 28X over QUICK on one GPU and 15x using eight GPUs. Strong scaling calculations show nearly ideal scalability up to 8 GPUs while retaining high parallel efficiency for up to 18 GPUs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据