4.5 Article

Integrative sparse partial least squares

期刊

STATISTICS IN MEDICINE
卷 40, 期 9, 页码 2239-2256

出版社

WILEY
DOI: 10.1002/sim.8900

关键词

contrasted penalization; integrative analysis; partial least squares

资金

  1. 111 Project [B13028]
  2. Fundamental Research Funds for the Central Universities [20720181003]
  3. National Institutes of Health [CA121974, CA196530]
  4. National Natural Science Foundation of China [11971404, 71988101]

向作者/读者索取更多资源

Partial least squares is important for dimension reduction in handling problems with numerous variables. Sparse partial least squares technique helps identify important variables and generate more interpretable results. Integrative analysis gathers raw data from multiple independent datasets and jointly analyzes them to improve performance.
Partial least squares, as a dimension reduction technique, has become increasingly important for its ability to deal with problems with a large number of variables. Since noisy variables may weaken estimation performance, the sparse partial least squares (SPLS) technique has been proposed to identify important variables and generate more interpretable results. However, the small sample size of a single dataset limits the performance of conventional methods. An effective solution comes from gathering information from multiple comparable studies. Integrative analysis has essential importance in multidatasets analysis. The main idea is to improve performance by assembling raw data from multiple independent datasets and analyzing them jointly. In this article, we develop an integrative SPLS (iSPLS) method using penalization based on the SPLS technique. The proposed approach consists of two penalties. The first penalty conducts variable selection under the context of integrative analysis. The second penalty, a contrasted penalty, is imposed to encourage the similarity of estimates across datasets and generate more sensible and accurate results. Computational algorithms are developed. Simulation experiments are conducted to compare iSPLS with alternative approaches. The practical utility of iSPLS is shown in the analysis of two TCGA gene expression data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据