期刊
STATISTICS IN MEDICINE
卷 40, 期 9, 页码 2239-2256出版社
WILEY
DOI: 10.1002/sim.8900
关键词
contrasted penalization; integrative analysis; partial least squares
类别
资金
- 111 Project [B13028]
- Fundamental Research Funds for the Central Universities [20720181003]
- National Institutes of Health [CA121974, CA196530]
- National Natural Science Foundation of China [11971404, 71988101]
Partial least squares is important for dimension reduction in handling problems with numerous variables. Sparse partial least squares technique helps identify important variables and generate more interpretable results. Integrative analysis gathers raw data from multiple independent datasets and jointly analyzes them to improve performance.
Partial least squares, as a dimension reduction technique, has become increasingly important for its ability to deal with problems with a large number of variables. Since noisy variables may weaken estimation performance, the sparse partial least squares (SPLS) technique has been proposed to identify important variables and generate more interpretable results. However, the small sample size of a single dataset limits the performance of conventional methods. An effective solution comes from gathering information from multiple comparable studies. Integrative analysis has essential importance in multidatasets analysis. The main idea is to improve performance by assembling raw data from multiple independent datasets and analyzing them jointly. In this article, we develop an integrative SPLS (iSPLS) method using penalization based on the SPLS technique. The proposed approach consists of two penalties. The first penalty conducts variable selection under the context of integrative analysis. The second penalty, a contrasted penalty, is imposed to encourage the similarity of estimates across datasets and generate more sensible and accurate results. Computational algorithms are developed. Simulation experiments are conducted to compare iSPLS with alternative approaches. The practical utility of iSPLS is shown in the analysis of two TCGA gene expression data.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据