4.6 Article

Accurate and rapid prediction of pKa of transition metal complexes: semiempirical quantum chemistry with a data-augmented approach

期刊

出版社

ROYAL SOC CHEMISTRY
DOI: 10.1039/d0cp05281g

关键词

-

资金

  1. ARC-CBBC project [2016.008]
  2. European Research Council (ERC) under the European Union [725686]
  3. SURF Cooperative

向作者/读者索取更多资源

The study presents a data-augmented approach to substantially improve the accuracy of the GFN2-xTB method for predicting thermochemical properties of transition metal complexes, using pK(a) values of TM hydrides as a representative model. By constructing a comprehensive database and refining the results with DFT calculations, the study shows a high predictive power of the AutoML model for estimating pK(a) values, indicating potential applications in high-throughput computational screening workflows for homogeneous TM-based catalysts.
Rapid and accurate prediction of reactivity descriptors of transition metal (TM) complexes is a major challenge for contemporary quantum chemistry. The recently-developed GFN2-xTB method based on the density functional tight-binding theory (DFT-B) is suitable for high-throughput calculation of geometries and thermochemistry for TM complexes albeit with moderate accuracy. Herein we present a data-augmented approach to improve substantially the accuracy of the GFN2-xTB method for the prediction of thermochemical properties using pK(a) values of TM hydrides as a representative model example. We constructed a comprehensive database for ca. 200 TM hydride complexes featuring the experimentally measured pK(a) values as well as the GFN2-xTB-optimized geometries and various computed electronic and energetic descriptors. The GFN2-xTB results were further refined and validated by DFT calculations with the hybrid PBE0 functional. Our results show that although the GFN2-xTB performs well in most cases, it fails to adequately describe TM complexes featuring multicarbonyl and multihydride ligand environments. The dataset was analyzed with the ordinary least squares (OLS) fitting and was used to construct an automated machine learning (AutoML) approach for the rapid estimation of pK(a) of TM hydride complexes. The results obtained show a high predictive power of the very fast AutoML model (RMSE similar to 2.7) comparable to that of the much slower DFT calculations (RMSE similar to 3). The presented data-augmented quantum chemistry-based approach is promising for high-throughput computational screening workflows of homogeneous TM-based catalysts.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据