4.8 Article

Real-time prediction of 1H and 13C chemical shifts with DFT accuracy using a 3D graph neural network

期刊

CHEMICAL SCIENCE
卷 12, 期 36, 页码 12012-12026

出版社

ROYAL SOC CHEMISTRY
DOI: 10.1039/d1sc03343c

关键词

-

资金

  1. NSF under the CCI Center for Computer Assisted Synthesis [CHE-1925607]
  2. Texas Advanced Computing Center at The University of Texas at Austin, Frontera Fellowships Program - National Science Foundation Award [1818253]
  3. National Science Foundation [ACI-1532235, ACI-1532236, ACI-1548562]
  4. NVIDIA Corporation

向作者/读者索取更多资源

Nuclear magnetic resonance (NMR) is a primary technique used to elucidate the chemical structure, bonding, stereochemistry, and conformation of organic compounds. In this study, a rapid machine learning protocol was developed to predict H-1 and C-13 chemical shifts through an efficient graph neural network (GNN) using 3D structures as input, improving prediction accuracy. The model demonstrated success in various applications, showcasing its potential in enhancing organic structural assignments using NMR data.
Nuclear magnetic resonance (NMR) is one of the primary techniques used to elucidate the chemical structure, bonding, stereochemistry, and conformation of organic compounds. The distinct chemical shifts in an NMR spectrum depend upon each atom's local chemical environment and are influenced by both through-bond and through-space interactions with other atoms and functional groups. The in silico prediction of NMR chemical shifts using quantum mechanical (QM) calculations is now commonplace in aiding organic structural assignment since spectra can be computed for several candidate structures and then compared with experimental values to find the best possible match. However, the computational demands of calculating multiple structural- and stereo-isomers, each of which may typically exist as an ensemble of rapidly-interconverting conformations, are expensive. Additionally, the QM predictions themselves may lack sufficient accuracy to identify a correct structure. In this work, we address both of these shortcomings by developing a rapid machine learning (ML) protocol to predict H-1 and C-13 chemical shifts through an efficient graph neural network (GNN) using 3D structures as input. Transfer learning with experimental data is used to improve the final prediction accuracy of a model trained using QM calculations. When tested on the CHESHIRE dataset, the proposed model predicts observed C-13 chemical shifts with comparable accuracy to the best-performing DFT functionals (1.5 ppm) in around 1/6000 of the CPU time. An automated prediction webserver and graphical interface are accessible online at http://nova.chem.colostate.edu/cascade/. We further demonstrate the model in three applications: first, we use the model to decide the correct organic structure from candidates through experimental spectra, including complex stereoisomers; second, we automatically detect and revise incorrect chemical shift assignments in a popular NMR database, the NMRShiftDB; and third, we use NMR chemical shifts as descriptors for determination of the sites of electrophilic aromatic substitution.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据