4.8 Article

Semi-supervised Machine Learning Enables the Robust Detection of Multireference Character at Low Cost

期刊

JOURNAL OF PHYSICAL CHEMISTRY LETTERS
卷 11, 期 16, 页码 6640-6648

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jpclett.0c02018

关键词

-

资金

  1. Office of Naval Research [N00014-17-1-2956, N00014-18-1-2434, N00014-20-1-2150]
  2. Department of Energy [DE-SC0018096]
  3. National Science Foundation [ACI-1548562]
  4. Burroughs Wellcome Fund
  5. AAAS Marion Milligan Mason Award
  6. MolSSI fellowship [ACI-1547580]

向作者/读者索取更多资源

Multireference (MR) diagnostics are common tools for identifying strongly correlated electronic structure that makes single-reference (SR) methods (e.g., density functional theory or DFT) insufficient for accurate property prediction. However, MR diagnostics typically require computationally demanding correlated wave function theory (WFT) calculations, and diagnostics often disagree or fail to predict MR effects on properties. To overcome these challenges, we introduce a semi-supervised machine learning (ML) approach with virtual adversarial training (VAT) of an MR classifier using 15 WFT and DFT MR diagnostics as inputs. In semi-supervised learning, only the most extreme SR or MR points are labeled, and the remaining point labels are learned. The resulting VAT model outperforms the alternatives, as quantified by the distinct property distributions of SR- and MR-classified molecules. To reduce the cost of generating inputs to the VAT model, we leverage the VAT model's robustness to noisy inputs by replacing WFT MR diagnostics with regression predictions in an MR decision engine workflow that preserves excellent performance. We demonstrate the transferability of our approach to larger molecules and those with distinct chemical composition from the training set. This MR decision engine demonstrates promise as a low-cost, high-accuracy approach to the automatic detection of strong correlation for predictive high-throughput screening.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据