4.7 Article

Group spike-and-slab lasso generalized linear models for disease prediction and associated genes detection by incorporating pathway information

期刊

BIOINFORMATICS
卷 34, 期 6, 页码 901-910

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btx684

关键词

-

资金

  1. USA National Institutes of Health [R03-DE024198, R03-DE025646]
  2. National Science Foundation [IIS-1502172]
  3. China Scholarship Council
  4. National Natural Science Foundation of China [81573253, 81773541, 81673448]
  5. Priority Academic Program Development of Jiangsu Higher Education Institutions at Soochow University, Natural Science Foundation of Jiangsu Province China [BK 20161218]
  6. Direct For Computer & Info Scie & Enginr
  7. Div Of Information & Intelligent Systems [1502273, 1502172] Funding Source: National Science Foundation
  8. Division Of Graduate Education
  9. Direct For Education and Human Resources [1523115] Funding Source: National Science Foundation

向作者/读者索取更多资源

Motivation: Large-scale molecular data have been increasingly used as an important resource for prognostic prediction of diseases and detection of associated genes. However, standard approaches for omics data analysis ignore the group structure among genes encoded in functional relationships or pathway information. Results: We propose new Bayesian hierarchical generalized linear models, called group spike-and-slab lasso GLMs, for predicting disease outcomes and detecting associated genes by incorporating large-scale molecular data and group structures. The proposed model employs a mixture double-exponential prior for coefficients that induces self-adaptive shrinkage amount on different coefficients. The group information is incorporated into the model by setting group-specific parameters. We have developed a fast and stable deterministic algorithm to fit the proposed hierarchal GLMs, which can perform variable selection within groups. We assess the performance of the proposed method on several simulated scenarios, by varying the overlap among groups, group size, number of non-null groups, and the correlation within group. Compared with existing methods, the proposed method provides not only more accurate estimates of the parameters but also better prediction. We further demonstrate the application of the proposed procedure on three cancer datasets by utilizing pathway structures of genes. Our results show that the proposed method generates powerful models for predicting disease outcomes and detecting associated genes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据