4.6 Article

Exploration of transferable and uniformly accurate neural network interatomic potentials using optimal experimental design

Journal

Publisher

IOP Publishing Ltd
DOI: 10.1088/2632-2153/abe294

Keywords

molecular machine learning; atomistic neural networks; active learning; optimal experimental design; computational chemistry

Funding

  1. Studienstiftung des deutschen Volkes (German National Academic Foundation)
  2. Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy [EXC 2075 -390740016]
  3. Stuttgart Center for Simulation Science (SimTech)

Ask authors/readers for more resources

The paper introduces a novel active learning approach that utilizes the output variance of the estimated model within the framework of optimal experimental design, offering advantages in predictive power and computational efficiency compared to established methods.
Machine learning has been proven to have the potential to bridge the gap between the accuracy of ab initio methods and the efficiency of empirical force fields. Neural networks are one of the most frequently used approaches to construct high-dimensional potential energy surfaces. Unfortunately, they lack an inherent uncertainty estimation which is necessary for efficient and automated sampling through the chemical and conformational space to find extrapolative configurations. The identification of the latter is needed for the construction of transferable and uniformly accurate potential energy surfaces. In this paper, we propose an active learning approach that uses the estimated model's output variance derived in the framework of the optimal experimental design. This method has several advantages compared to the established active learning approaches, e.g. Query-by-Committee, Monte Carlo dropout, feature and latent distances, in terms of the predictive power and computational efficiency. We have shown that the application of the proposed active learning scheme leads to transferable and uniformly accurate potential energy surfaces constructed using only a small fraction of data points. Additionally, it is possible to define a natural threshold value for the proposed uncertainty metric which offers the possibility to generate highly informative training data on-the-fly.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available