4.7 Article

The analysis of a plane wave pseudopotential density functional theory code on a GPU machine

期刊

COMPUTER PHYSICS COMMUNICATIONS
卷 184, 期 1, 页码 9-18

出版社

ELSEVIER
DOI: 10.1016/j.cpc.2012.08.002

关键词

Electronic structure; First-principles; Density functional theory; Plane wave pseudopotential; GPU

资金

  1. National Basic Research Program of China [2010CB832702]
  2. NSF of China [61202054, 10972215, 60873113, 11071047]
  3. Knowledge Innovation Program of CAS [CNIC_ZR_201202]
  4. 863 Program [2010AA012301, 2010AA012402]
  5. 111 Project
  6. Office of Science, Office of Basic Energy Sciences, Materials Science and Engineering Division, of the US Department of Energy (DOE) [DE-AC02-05CH11231]

向作者/读者索取更多资源

Plane wave pseudopotential (PWP) density functional theory (DFT) calculation is the most widely used material science simulation, and the PWP DFT codes are arguably the most important material science codes. We have implemented a PWP DFT code PEtot on a multi-node GPU machine. Starting from a previous work, we have further improved the speed of the code, and achieved x13-x22 speedups over the CPU calculations for a typical 512 atom system. Such speedups are much higher than other similar works for this important class of material simulation codes on CPU clusters. The current achievement is obtained by (1) moving the calculation fully into the GPU; (2) adopting a new algorithm to reduce the data amount for MPI communication; and (3) using new CPU and CPU numerical libraries. We have also provided a detail quantitative analysis of the computational times for different physical systems and number of GPU units, which helps one to understand the challenges and bottlenecks of the PWP OFT simulations on CPU machines. Based on the analysis, we listed the machine and library requirements in order to further improve the performances of the PWP DFT calculations. (C) 2012 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据