期刊
PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22)
卷 -, 期 -, 页码 1156-1163出版社
ASSOC COMPUTING MACHINERY
DOI: 10.1145/3512290.3528824
关键词
genetic algorithm; molecular representation; molecule design
资金
- German Research Foundation (DFG) [GRK 1765/2]
This study presents a genetic algorithm-based approach for reconstructing molecules from their fingerprint representation and introduces a novel Transformer neural language model trained on molecular fingerprints for molecule generation.
For in silico drug discovery various representations have been established regarding storing and processing molecular data. The choice of representation has a great impact on employed methods and algorithms. Molecular fingerprints in the form of fixed-size bit vectors are a widely used representation which captures structural features of a molecule and enables a straight-forward way of estimating molecule similarities. However, since fingerprints are not invertible, they are rarely utilized for molecule generation tasks. This study presents an approach to the reconstruction of molecules from their fingerprint representation that is based on genetic algorithms. The algorithm assembles molecules from BRICS fragments and therefore only generates valid molecular structures. We demonstrate that the genetic algorithm is able to construct molecules similar to the specified target, or even reconstruct the original molecule. Furthermore, to illustrate how this genetic algorithm unlocks fingerprints as a representation for other in silico drug discovery methods, a novel Transformer neural language model trained on molecular fingerprints is introduced as a molecule generation model.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据