4.7 Article

Computing the Density Matrix in Electronic Structure Theory on Graphics Processing Units

Journal

JOURNAL OF CHEMICAL THEORY AND COMPUTATION
Volume 8, Issue 11, Pages 4094-4101

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/ct300442w

Keywords

-

Funding

  1. LDRD program at Los Alamos National Laboratory
  2. Office of Advanced Cyberinfrastructure (OAC)
  3. Direct For Computer & Info Scie & Enginr [0910735] Funding Source: National Science Foundation

Ask authors/readers for more resources

The Self-consistent solution of a Schrodinger-like equation for the density matrix is a critical and computationally demanding step in quantum based models of interatomic bonding. This step was tackled historically via the diagonalization of the Hamiltonian. We have investigated the performance and accuracy of the second-order spectral projection (SP2) algorithm for the computation of the density matrix via a recursive expansion of the Fermi operator in a series of generalized matrix-matrix multiplications. We demonstrate that owing to its simplicity, the SP2 algorithm [Niklasson, A. M. N. Phys. Rev. B 2002, 66, 155115] is exceptionally well suited to implementation on graphics processing units (GPUs). The performance in double and single precision arithmetic of a hybrid GPU/central processing unit (CPU), and full GPU implementation of the SP2 algorithm exceed those of a CPU only implementation of the SP2 algorithm and traditional matrix diagonalization when the dimensions of the matrices exceed about 2000 x 2000. Padding schemes for arrays allocated hi the GPU memory that optimize the performance. Of the CUBLAS implementations of the level 3 BLAS DGEMM and SGEMM subroutines for generalized matrix-matrix multiplications are described in detail. The analysis Of the relative performance of the hybrid CPU/GPU and full GPU implementations indicate that the transfer of arrays between the GPU and CPU constitutes only a small fraction Of the total computation time The errors measured in the self consistent density matrices computed using the SP2 algorithm are generally smaller than those measured in matrices computed via diagonalization. Furthermore, the errors in the density matrices computed using the SP2 algorithm do not exhibit any dependence of system size, whereas the errors increase linearly with the number of orbitals when diagonalization is employed.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available