4.7 Article

Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure-Potency Fingerprint

Journal

BIOMOLECULES
Volume 13, Issue 2, Pages -

Publisher

MDPI
DOI: 10.3390/biom13020393

Keywords

bioactive compounds; potency prediction; fingerprints; machine learning; conditional variational autoencoder

Ask authors/readers for more resources

Prediction of potency of bioactive compounds is typically done using linear or nonlinear quantitative structure-activity relationship (QSAR) models. In this study, a novel approach called structure-potency fingerprint (SPFP) is introduced, which combines structural features and potency values into a single bit string representation. By using a conditional variational autoencoder (CVAE) with SPFPs, the potency module of test compounds can be accurately predicted using only their structure module. The SPFP-CVAE approach achieves comparable accuracy to support vector regression (SVR) and deep neural networks in predicting compounds' activity classes and potency values.
Prediction of the potency of bioactive compounds generally relies on linear or nonlinear quantitative structure-activity relationship (QSAR) models. Nonlinear models are generated using machine learning methods. We introduce a novel approach for potency prediction that depends on a newly designed molecular fingerprint (FP) representation. This structure-potency fingerprint (SPFP) combines different modules accounting for the structural features of active compounds and their potency values in a single bit string, hence unifying structure and potency representation. This encoding enables the derivation of a conditional variational autoencoder (CVAE) using SPFPs of training compounds and apply the model to predict the SPFP potency module of test compounds using only their structure module as input. The SPFP-CVAE approach correctly predicts the potency values of compounds belonging to different activity classes with an accuracy comparable to support vector regression (SVR), representing the state-of-the-art in the field. In addition, highly potent compounds are predicted with very similar accuracy as SVR and deep neural networks.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available