4.7 Article

Intrinsic Aqueous Solubility: Mechanistically Transparent Data-Driven Modeling of Drug Substances

Journal

PHARMACEUTICS
Volume 14, Issue 10, Pages -

Publisher

MDPI
DOI: 10.3390/pharmaceutics14102248

Keywords

solubility; drug substances; QSAR; QSPR; fit-for-purpose training set; multiple linear regression; consensus model

Funding

  1. Ministry of Education and Research, Republic of Estonia through Estonian Research Council [PRG1509]
  2. European Union European Regional Development Fund through Foundation Archimedes [TK143]

Ask authors/readers for more resources

In this study, authors developed data-driven models for predicting intrinsic aqueous solubility of drug substances using curated training data sets and derived three quantitative structure-property relationships. The models, which are mechanistically transparent and easy to understand, showed significant improvement in prediction capability and reduction of outliers through a consensus modeling approach. The developed models have been published in the QsarDB.org repository according to FAIR principles for unrestricted use in exploration, downloading, and predictions.
Intrinsic aqueous solubility is a foundational property for understanding the chemical, technological, pharmaceutical, and environmental behavior of drug substances. Despite years of solubility research, molecular structure-based prediction of the intrinsic aqueous solubility of drug substances is still under active investigation. This paper describes the authors' systematic data-driven modelling in which two fit-for-purpose training data sets for intrinsic aqueous solubility were collected and curated, and three quantitative structure-property relationships were derived to make predictions for the most recent solubility challenge. All three models perform well individually, while being mechanistically transparent and easy to understand. Molecular descriptors involved in the models are related to the following key steps in the solubility process: dissociation of the molecule from the crystal, formation of a cavity in the solvent, and insertion of the molecule into the solvent. A consensus modeling approach with these models remarkably improved prediction capability and reduced the number of strong outliers by more than two times. The performance and outliers of the second solubility challenge predictions were analyzed retrospectively. All developed models have been published in the QsarDB.org repository according to FAIR principles and can be used without restrictions for exploring, downloading, and making predictions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available