☆ 4.4 Article

Statistical variation in progressive scrambling

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN (2004)

期刊

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN

卷 18, 期 7-9, 页码 563-576

出版社

KLUWER ACADEMIC PUBL

DOI: 10.1007/s10822-004-4077-z

关键词

cross-validation; PLS; progressive scrambling; redundancy; response randomization

类别

Biochemistry & Molecular Biology Biophysics Computer Science, Interdisciplinary Applications

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The two methods most often used to evaluate the robustness and predictivity of partial least squares (PLS) models are cross-validation and response randomization. Both methods may be overly optimistic for data sets that contain redundant observations, however. The kinds of perturbation analysis widely used for evaluating model stability in the context of ordinary least squares regression are only applicable when the descriptors are independent of each other and errors are independent and normally distributed; neither assumption holds for QSAR in general and for PLS in particular. Progressive scrambling is a novel, nonparametric approach to perturbing models in the response space in a way that does not disturb the underlying covariance structure of the data. Here, we introduce adjustments for two of the characteristic values produced by a progressive scrambling analysis - the deprecated predictivity (Q(s)(*2)) and standard error of prediction (SDEPs*) - that correct for the effect of introduced perturbation. We also explore the statistical behavior of the adjusted values (Q(0)(*2) and SDEP0*) and the sensitivity to perturbation (dq(2)/dr(yy)(2)). It is shown that the three statistics are all robust for stable PLS models, in terms of the stochastic component of their determination and of their variation due to sampling effects involved in training set selection.

Statistical variation in progressive scrambling

期刊

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN

出版社

KLUWER ACADEMIC PUBL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Statistical variation in progressive scrambling

期刊

JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN

出版社

KLUWER ACADEMIC PUBL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文