3.8 Proceedings Paper

Computational Workflow for Accelerated Molecular Design Using Quantum Chemical Simulations and Deep Learning Models

出版社

SPRINGER INTERNATIONAL PUBLISHING AG
DOI: 10.1007/978-3-031-23606-8_1

关键词

-

资金

  1. Office of Science of the Department of Energy
  2. Laboratory Directed Research and Development (LDRD) Program of Oak Ridge National Laboratory
  3. Artificial Intelligence Initiative as part of the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory
  4. US Department of Energy [DE-AC05-00OR22725]
  5. Exascale Computing Project of the U.S. Department of Energy Office of Science [17-SC-20-SC]
  6. Exascale Computing Project of the National Nuclear Security Administration [17-SC-20-SC]
  7. Office of Science of the U.S. Department of Energy [DE-AC05-00OR22725]

向作者/读者索取更多资源

This study presents a workflow for accelerating the design of molecular compounds by combining approximate quantum chemical methods, a neural network surrogate model for chemical property prediction, and a language model for molecule generation. The workflow enables faster searching of chemical space and the generation of optimized molecules, showing potential for a wide range of design problems.
Efficient methods for searching the chemical space of molecular compounds are needed to automate and accelerate the design of new functional molecules such as pharmaceuticals. Given the high cost in both resources and time for experimental efforts, computational approaches play a key role in guiding the selection of promising molecules for further investigation. Here, we construct a workflow to accelerate design by combining approximate quantum chemical methods [i.e. density-functional tight-binding (DFTB)], a graph convolutional neural network (GCNN) surrogate model for chemical property prediction, and a masked language model (MLM) for molecule generation. Property data from the DFTB calculations are used to train the surrogate model; the surrogate model is used to score candidates generated by the MLM. The surrogate reduces computation time by orders of magnitude compared to the DFTB calculations, enabling an increased search of chemical space. Furthermore, the MLM generates a diverse set of chemical modifications based on pre-training from a large compound library. We utilize the workflow to search for near-infrared photoactive molecules by minimizing the predicted HOMO-LUMO gap as the target property. Our results show that the workflow can generate optimized molecules outside of the original training set, which suggests that iterations of the workflow could be useful for searching vast chemical spaces in a wide range of design problems.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据