4.7 Article

Challenges for machine learning force fields in reproducing potential energy surfaces of flexible molecules

Journal

JOURNAL OF CHEMICAL PHYSICS
Volume 154, Issue 9, Pages -

Publisher

AIP Publishing
DOI: 10.1063/5.0038516

Keywords

-

Funding

  1. Luxembourg National Research Fund (FNR) under the program DTU PRIDE MASSENA [PRIDE/15/10935404]
  2. FNR AFR [14593813]
  3. European Research Council (ERC-CoG BeStMo)
  4. FNR [C19/MS/13718694/QML-FLEX]

Ask authors/readers for more resources

The study evaluates the performance of machine learning models in reproducing complex potential-energy surfaces, finding that predictions greatly depend on the ML method used and the local region of the PES being sampled. It suggests switching from learning the entire PES within a single model to using multiple local models for different parts of the complex PES.
Dynamics of flexible molecules are often determined by an interplay between local chemical bond fluctuations and conformational changes driven by long-range electrostatics and van der Waals interactions. This interplay between interactions yields complex potential-energy surfaces (PESs) with multiple minima and transition paths between them. In this work, we assess the performance of the state-of-the-art Machine Learning (ML) models, namely, sGDML, SchNet, Gaussian Approximation Potentials/Smooth Overlap of Atomic Positions (GAPs/SOAPs), and Behler-Parrinello neural networks, for reproducing such PESs, while using limited amounts of reference data. As a benchmark, we use the cis to trans thermal relaxation in an azobenzene molecule, where at least three different transition mechanisms should be considered. Although GAP/SOAP, SchNet, and sGDML models can globally achieve a chemical accuracy of 1 kcal mol(-1) with fewer than 1000 training points, predictions greatly depend on the ML method used and on the local region of the PES being sampled. Within a given ML method, large differences can be found between predictions of close-to-equilibrium and transition regions, as well as for different transition mechanisms. We identify key challenges that the ML models face mainly due to the intrinsic limitations of commonly used atom-based descriptors. All in all, our results suggest switching from learning the entire PES within a single model to using multiple local models with optimized descriptors, training sets, and architectures for different parts of the complex PES.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available