3.8 Article

De novo generated combinatorial library design

Journal

DIGITAL DISCOVERY
Volume -, Issue -, Pages -

Publisher

ROYAL SOC CHEMISTRY
DOI: 10.1039/d3dd00095h

Keywords

-

Ask authors/readers for more resources

Artificial intelligence (AI) offers new approaches to design compounds in drug discovery, such as suggesting new molecular structures or optimizing existing leads. However, the lack of high-quality data sets limits the effectiveness of AI methods. This study proposes a framework for designing combinatorial libraries using a molecular generative model and optimization algorithms. Simulation experiments show that near-optimal library designs can be achieved even without synthesizing additional building blocks.
Artificial intelligence (AI) contributes new methods for designing compounds in drug discovery, ranging from de novo design models suggesting new molecular structures or optimizing existing leads to predictive models evaluating their toxicological properties. However, a limiting factor for the effectiveness of AI methods in drug discovery is the lack of access to high-quality data sets leading to a focus on approaches optimizing data generation. Combinatorial library design is a popular approach for bioactivity testing as a large number of molecules can be synthesized from a limited number of building blocks. We propose a framework for designing combinatorial libraries using a molecular generative model to generate building blocks de novo, followed by using k-determinantal point processes and Gibbs sampling to optimize a selection from the generated blocks. We explore optimization of biological activity, Quantitative Estimate of Drug-likeness (QED) and diversity and the trade-offs between them, both in single-objective and in multi-objective library design settings. Using retrosynthesis models to estimate building block availability, the proposed framework is able to explore the prospective benefit from expanding a stock of available building blocks by synthesis or by purchasing the preferred building blocks before designing a library. In simulation experiments with building block collections from all available commercial vendors near-optimal libraries could be found without synthesis of additional building blocks; in other simulation experiments we showed that even one synthesis step to increase the number of available building blocks could improve library designs when starting with an in-house building block collection of reasonable size.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available