☆ 4.7 Article

Sparse partial least-squares regression and its applications to high-throughput data analysis

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2011)

期刊

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS

卷 109, 期 1, 页码 1-8

出版社

ELSEVIER

DOI: 10.1016/j.chemolab.2011.07.002

关键词

Lasso; Modeling; Prediction; Regression analyses; Variable selection

类别

Automation & Control Systems Chemistry, Analytical Computer Science, Artificial Intelligence Instruments & Instrumentation Mathematics, Interdisciplinary Applications Statistics & Probability

资金

Swedish Research Council
National Research Foundation of Korea(NRF)
Ministry of Education, Science and Technology [2010-0011372]
National Research Foundation of Korea [2010-0011372] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The partial least-squares (PLS) method is designed for prediction problems where the number of predictors is larger than the number of training samples. PIS is based on latent components that are linear combinations of all of the original predictors, so it automatically employs all predictors regardless of their relevance. This will potentially compromise its performance, but it will also make it difficult to interpret the result. In this paper, we propose a new formulation of the sparse PIS (SPLS) procedure to allow both sparse variable selection and dimension reduction. We use the standard L-1-penalty and the unbounded penalty of [1]. We develop a computing algorithm for SPLS by modifying the nonlinear iterative partial least-squares (NIPALS) algorithm, and illustrate the method with an analysis of a cancer dataset. Through the numerical studies we find that our SPLS method generally performs better than the standard PIS and other existing methods in variable selection and prediction. (C) 2011 Elsevier B.V. All rights reserved.

Sparse partial least-squares regression and its applications to high-throughput data analysis

期刊

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Sparse partial least-squares regression and its applications to high-throughput data analysis

期刊

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文