☆ 4.7 Article

Computing the Density Matrix in Electronic Structure Theory on Graphics Processing Units

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2012)

Journal

JOURNAL OF CHEMICAL THEORY AND COMPUTATION

Volume 8, Issue 11, Pages 4094-4101

Publisher

AMER CHEMICAL SOC

DOI: 10.1021/ct300442w

Keywords

Funding

LDRD program at Los Alamos National Laboratory
Office of Advanced Cyberinfrastructure (OAC)
Direct For Computer & Info Scie & Enginr [0910735] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

The Self-consistent solution of a Schrodinger-like equation for the density matrix is a critical and computationally demanding step in quantum based models of interatomic bonding. This step was tackled historically via the diagonalization of the Hamiltonian. We have investigated the performance and accuracy of the second-order spectral projection (SP2) algorithm for the computation of the density matrix via a recursive expansion of the Fermi operator in a series of generalized matrix-matrix multiplications. We demonstrate that owing to its simplicity, the SP2 algorithm [Niklasson, A. M. N. Phys. Rev. B 2002, 66, 155115] is exceptionally well suited to implementation on graphics processing units (GPUs). The performance in double and single precision arithmetic of a hybrid GPU/central processing unit (CPU), and full GPU implementation of the SP2 algorithm exceed those of a CPU only implementation of the SP2 algorithm and traditional matrix diagonalization when the dimensions of the matrices exceed about 2000 x 2000. Padding schemes for arrays allocated hi the GPU memory that optimize the performance. Of the CUBLAS implementations of the level 3 BLAS DGEMM and SGEMM subroutines for generalized matrix-matrix multiplications are described in detail. The analysis Of the relative performance of the hybrid CPU/GPU and full GPU implementations indicate that the transfer of arrays between the GPU and CPU constitutes only a small fraction Of the total computation time The errors measured in the self consistent density matrices computed using the SP2 algorithm are generally smaller than those measured in matrices computed via diagonalization. Furthermore, the errors in the density matrices computed using the SP2 algorithm do not exhibit any dependence of system size, whereas the errors increase linearly with the number of orbitals when diagonalization is employed.

Computing the Density Matrix in Electronic Structure Theory on Graphics Processing Units

Journal

JOURNAL OF CHEMICAL THEORY AND COMPUTATION

Publisher

AMER CHEMICAL SOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Computing the Density Matrix in Electronic Structure Theory on Graphics Processing Units

Journal

JOURNAL OF CHEMICAL THEORY AND COMPUTATION

Publisher

AMER CHEMICAL SOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper