4.8 Article

Retrosynthetic reaction pathway prediction through neural machine translation of atomic environments

Journal

NATURE COMMUNICATIONS
Volume 13, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41467-022-28857-w

Keywords

-

Funding

  1. National Research Foundation of Korea (NRF) - Korean government (MSIT) [NRF-2019M3E5D4066898, NRF-2018R1C1B600543513, NRF2020M3A9G7103933]
  2. Korea Environment Industry & Technology Institute (KEITI) through the Technology Development Project for Safety Management of Household Chemical Products - Korea Ministry of Environment (MOE) [KEITI:2020002960002, NTIS:1485017120]
  3. Arontier co.

Ask authors/readers for more resources

The study presents a new approach for retrosynthetic prediction using fragments and the Transformer architecture. By learning the changes in atom environments, the method predicts reactant candidates for chemical reactions, achieving high accuracy in reaction route prediction and discovery.
Reaction route planning remains a major challenge in organic synthesis. The authors present a retrosynthetic prediction model using the fragment-based representation of molecules and the Transformer architecture in neural machine translation. Designing efficient synthetic routes for a target molecule remains a major challenge in organic synthesis. Atom environments are ideal, stand-alone, chemically meaningful building blocks providing a high-resolution molecular representation. Our approach mimics chemical reasoning, and predicts reactant candidates by learning the changes of atom environments associated with the chemical reaction. Through careful inspection of reactant candidates, we demonstrate atom environments as promising descriptors for studying reaction route prediction and discovery. Here, we present a new single-step retrosynthesis prediction method, viz. RetroTRAE, being free from all SMILES-based translation issues, yields a top-1 accuracy of 58.3% on the USPTO test dataset, and top-1 accuracy reaches to 61.6% with the inclusion of highly similar analogs, outperforming other state-of-the-art neural machine translation-based methods. Our methodology introduces a novel scheme for fragmental and topological descriptors to be used as natural inputs for retrosynthetic prediction tasks.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available