4.4 Article

Optimal Piecewise Linear Regression Algorithm for QSAR Modelling

期刊

MOLECULAR INFORMATICS
卷 38, 期 3, 页码 -

出版社

WILEY-V C H VERLAG GMBH
DOI: 10.1002/minf.201800028

关键词

qsar; regression; piecewise regression; mathematical programming; integer programming

资金

  1. CAPES, Brazil [13312138]
  2. Leverhulme Trust [RPG-2015-240]
  3. UK Engineering & Physical Sciences Research Council [EP/I033270/1, EP/M027856/1]
  4. EPSRC [EP/I033270/1, EP/M027856/1] Funding Source: UKRI

向作者/读者索取更多资源

Quantitative Structure-Activity Relationship (QSAR) models have been successfully applied to lead optimisation, virtual screening and other areas of drug discovery over the years. Recent studies, however, have focused on the development of models that are predictive but often not interpretable. In this article, we propose the application of a piecewise linear regression algorithm, OPLRAreg, to develop both predictive and interpretable QSAR models. The algorithm determines a feature to best separate the data into regions and identifies linear equations to predict the outcome variable in each region. A regularisation term is introduced to prevent overfitting problems and implicitly selects the most informative features. As OPLRAreg is based on mathematical programming, a flexible and transparent representation for optimisation problems, the algorithm also permits customised constraints to be easily added to the model. The proposed algorithm is presented as a more interpretable alternative to other commonly used machine learning algorithms and has shown comparable predictive accuracy to Random Forest, Support Vector Machine and Random Generalised Linear Model on tests with five QSAR data sets compiled from the ChEMBL database.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据