4.8 Article

Multistep retrosynthesis combining a disconnection aware triple transformer loop with a route penalty score guided tree search

期刊

CHEMICAL SCIENCE
卷 14, 期 36, 页码 9959-9969

出版社

ROYAL SOC CHEMISTRY
DOI: 10.1039/d3sc01604h

关键词

-

向作者/读者索取更多资源

This study introduces a new tool for computer-aided synthesis planning that can automatically learn organic reactivity and perform retrosynthesis. The tool combines a triple transformer loop and a multistep tree search algorithm to propose short synthetic routes based on commercial starting materials.
Computer-aided synthesis planning (CASP) aims to automatically learn organic reactivity from literature and perform retrosynthesis of unseen molecules. CASP systems must learn reactions sufficiently precisely to propose realistic disconnections, while avoiding overfitting to leave room for diverse options, and explore possible routes such as to allow short synthetic sequences to emerge. Herein we report an open-source CASP tool proposing original solutions to both challenges. First, we use a triple transformer loop (TTL) predicting starting materials (T1), reagents (T2), and products (T3) to explore various disconnection sites defined by combining systematic, template-based, and transformer-based tagging procedures. Second, we integrate TTL into a multistep tree search algorithm (TTLA) prioritizing sequences using a route penalty score (RPScore) considering the number of steps, their confidence score, and the simplicity of all intermediates along the route. Our approach favours short synthetic routes to commercial starting materials, as exemplified by retrosynthetic analyses of recently approved drugs. An efficient transformer-based retrosynthesis model, the triple-transformer loop algorithm (TTLA), is reported and proposes short routes from commercial building blocks for a variety of drugs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据