4.7 Article

Intrinsic Aqueous Solubility: Mechanistically Transparent Data-Driven Modeling of Drug Substances

期刊

PHARMACEUTICS
卷 14, 期 10, 页码 -

出版社

MDPI
DOI: 10.3390/pharmaceutics14102248

关键词

solubility; drug substances; QSAR; QSPR; fit-for-purpose training set; multiple linear regression; consensus model

资金

  1. Ministry of Education and Research, Republic of Estonia through Estonian Research Council [PRG1509]
  2. European Union European Regional Development Fund through Foundation Archimedes [TK143]

向作者/读者索取更多资源

In this study, authors developed data-driven models for predicting intrinsic aqueous solubility of drug substances using curated training data sets and derived three quantitative structure-property relationships. The models, which are mechanistically transparent and easy to understand, showed significant improvement in prediction capability and reduction of outliers through a consensus modeling approach. The developed models have been published in the QsarDB.org repository according to FAIR principles for unrestricted use in exploration, downloading, and predictions.
Intrinsic aqueous solubility is a foundational property for understanding the chemical, technological, pharmaceutical, and environmental behavior of drug substances. Despite years of solubility research, molecular structure-based prediction of the intrinsic aqueous solubility of drug substances is still under active investigation. This paper describes the authors' systematic data-driven modelling in which two fit-for-purpose training data sets for intrinsic aqueous solubility were collected and curated, and three quantitative structure-property relationships were derived to make predictions for the most recent solubility challenge. All three models perform well individually, while being mechanistically transparent and easy to understand. Molecular descriptors involved in the models are related to the following key steps in the solubility process: dissociation of the molecule from the crystal, formation of a cavity in the solvent, and insertion of the molecule into the solvent. A consensus modeling approach with these models remarkably improved prediction capability and reduced the number of strong outliers by more than two times. The performance and outliers of the second solubility challenge predictions were analyzed retrospectively. All developed models have been published in the QsarDB.org repository according to FAIR principles and can be used without restrictions for exploring, downloading, and making predictions.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据