4.7 Article

NENCI-2021. I. A large benchmark database of non-equilibrium non-covalent interactions emphasizing close intermolecular contacts

Journal

JOURNAL OF CHEMICAL PHYSICS
Volume 155, Issue 18, Pages -

Publisher

AIP Publishing
DOI: 10.1063/5.0068862

Keywords

-

Funding

  1. Cornell University
  2. National Science Foundation [CHE-1945676]
  3. Alfred P. Sloan Research Fellowship
  4. Office of Science of the U.S. Department of Energy [DE-AC02-05CH11231]

Ask authors/readers for more resources

NENCI-2021 presents a benchmark database of 8000 intermolecular complexes for chemical sciences, systematically sampling potential energy surfaces and close contacts, providing valuable data for testing, training, and developing various chemical simulation methods.
In this work, we present NENCI-2021, a benchmark database of similar to 8000 Non-Equilibirum Non-Covalent Interaction energies for a large and diverse selection of intermolecular complexes of biological and chemical relevance. To meet the growing demand for large and high-quality quantum mechanical data in the chemical sciences, NENCI-2021 starts with the 101 molecular dimers in the widely used S66 and S101 databases and extends the scope of these works by (i) including 40 cation-pi and anion-pi complexes, a fundamentally important class of non-covalent interactions that are found throughout nature and pose a substantial challenge to theory, and (ii) systematically sampling all 141 intermolecular potential energy surfaces (PESs) by simultaneously varying the intermolecular distance and intermolecular angle in each dimer. Designed with an emphasis on close contacts, the complexes in NENCI-2021 were generated by sampling seven intermolecular distances along each PES (ranging from 0.7x to 1.1x the equilibrium separation) and nine intermolecular angles per distance (five for each ion-pi complex), yielding an extensive database of 7763 benchmark intermolecular interaction energies (E-int) obtained at the coupled-cluster with singles, doubles, and perturbative triples/complete basis set [CCSD(T)/CBS] level of theory. The E-int values in NENCI-2021 span a total of 225.3 kcal/mol, ranging from -38.5 to +186.8 kcal/mol, with a mean (median) E-int value of -1.06 kcal/mol (-2.39 kcal/mol). In addition, a wide range of intermolecular atom-pair distances are also present in NENCI-2021, where close intermolecular contacts involving atoms that are located within the so-called van der Waals envelope are prevalent-these interactions, in particular, pose an enormous challenge for molecular modeling and are observed in many important chemical and biological systems. A detailed symmetry-adapted perturbation theory (SAPT)-based energy decomposition analysis also confirms the diverse and comprehensive nature of the intermolecular binding motifs present in NENCI-2021, which now includes a significant number of primarily induction-bound dimers (e.g., cation-pi complexes). NENCI-2021 thus spans all regions of the SAPT ternary diagram, thereby warranting a new four-category classification scheme that includes complexes primarily bound by electrostatics (3499), induction (700), dispersion (1372), or mixtures thereof (2192). A critical error analysis performed on a representative set of intermolecular complexes in NENCI-2021 demonstrates that the E-int values provided herein have an average error of +/- 0.1 kcal/mol, even for complexes with strongly repulsive E-int values, and maximum errors of +/- 0.2-0.3 kcal/mol (i.e., similar to +/- 1.0 kJ/mol) for the most challenging cases. For these reasons, we expect that NENCI-2021 will play an important role in the testing, training, and development of next-generation classical and polarizable force fields, density functional theory approximations, wavefunction theory methods, and machine learning based intra- and inter-molecular potentials.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available