期刊
BIOINFORMATICS
卷 38, 期 16, 页码 3918-3926出版社
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac416
关键词
-
类别
资金
- EPSRC
This paper proposes an interpretable and scalable Bayesian proportional hazards model, referred to as sparse variational Bayes, for analyzing high-dimensional sparse survival data. The proposed method overcomes the high computational cost of traditional methods and offers a mechanism for variable selection via posterior inclusion probabilities. Extensive simulations demonstrate the comparable or better performance of the proposed method compared to state-of-the-art Bayesian variable selection methods.
Motivation: Few Bayesian methods for analyzing high-dimensional sparse survival data provide scalable variable selection, effect estimation and uncertainty quantification. Such methods often either sacrifice uncertainty quantification by computing maximum a posteriori estimates, or quantify the uncertainty at high (unscalable) computational expense. Results: We bridge this gap and develop an interpretable and scalable Bayesian proportional hazards model for prediction and variable selection, referred to as sparse variational Bayes. Our method, based on a mean-field variational approximation, overcomes the high computational cost of Markov chain Monte Carlo, whilst retaining useful features, providing a posterior distribution for the parameters and offering a natural mechanism for variable selection via posterior inclusion probabilities. The performance of our proposed method is assessed via extensive simulations and compared against other state-of-the-art Bayesian variable selection methods, demonstrating comparable or better performance. Finally, we demonstrate how the proposed method can be used for variable selection on two transcriptomic datasets with censored survival outcomes, and how the uncertainty quantification offered by our method can be used to provide an interpretable assessment of patient risk.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据