4.7 Article

Inverse QSPR/QSAR Analysis for Chemical Structure Generation (from y to x)

Journal

JOURNAL OF CHEMICAL INFORMATION AND MODELING
Volume 56, Issue 2, Pages 286-299

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.5b00628

Keywords

-

Funding

  1. Core Research for Evolutionary Science and Technology (CREST) Project 'Development of a knowledge-generating platform driven by big data in drug discovery through production processes' of the Japan Science and Technology Agency (JST)

Ask authors/readers for more resources

Retrieving descriptor information (x information) from a value of an objective variable (y) is a fundamental problem in inverse quantitative structure property relationship (inverse-QSPR) analysis but challenging because of the complexity of the preimage function. Herewith, we propose using a cluster-wise multiple linear regression (cMLR) model as a QSPR model for inverse-QSPR analysis. x information is acquired as a probability density function by combining cMLR and the prior distribution modeled with a mixture of Gaussians (GMMs). Three case studies were conducted to demonstrate various aspects of the potential of cMLR. It was found that the predictive power of cMLR was superior to that of MLR, especially for data with nonlinearity. Moreover, it turned out that the applicability domain could be considered since the posterior distribution inherits the prior distribution's feature (i.e., training data feature) and represents the possibility of having the desired property. Finally, a series of inverse analyses with the GMMs/cMLR was demonstrated with the aim to generate de novo structures having specific aqueous solubility.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available