☆ 4.0 Article

Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery

WORLD JOURNAL OF CLINICAL ONCOLOGY (2018)

期刊

WORLD JOURNAL OF CLINICAL ONCOLOGY

卷 9, 期 5, 页码 98-109

出版社

BAISHIDENG PUBLISHING GROUP INC

DOI: 10.5306/wjco.v9.i5.98

关键词

Supervised machine learning; Rule-based models; Bayesian methods; Background knowledge; Informative priors; Biomarker discovery

类别

Oncology

资金

National Institute of General Medical Sciences of the National Institutes of Health [R01GM100387]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

AIM To develop a framework to incorporate background domain knowledge into classification rule learning for knowledge discovery in biomedicine. METHODS Bayesian rule learning (BRL) is a rule-based classifier that uses a greedy best-first search over a space of bayesian belief-networks (BN) to find the optimal BN to explain the input dataset, and then infers classification rules from this BN. BRL uses a Bayesian score to evaluate the quality of BNs. In this paper, we extended the Bayesian score to include informative structure priors, which encodes our prior domain knowledge about the dataset. We call this extension of BRL as BRLp. The structure prior has a lambda hyperparameter that allows the user to tune the degree of incorporation of the prior knowledge in the model learning process. We studied the effect of lambda on model learning using a simulated dataset and a real-world lung cancer prognostic biomarker dataset, by measuring the degree of incorporation of our specified prior knowledge. We also monitored its effect on the model predictive performance. Finally, we compared BRLp to other state-of-the-art classifiers commonly used in biomedicine. RESULTS We evaluated the degree of incorporation of prior knowledge into BRLp, with simulated data by measuring the Graph Edit Distance between the true data-generating model and the model learned by BRLp. We specified the true model using informative structure priors. We observed that by increasing the value of lambda we were able to increase the influence of the specified structure priors on model learning. A large value of lambda of BRLp caused it to return the true model. This also led to a gain in predictive performance measured by area under the receiver operator characteristic curve (AUC). We then obtained a publicly available real-world lung cancer prognostic biomarker dataset and specified a known biomarker from literature [the epidermal growth factor receptor (EGFR) gene]. We again observed that larger values of lambda led to an increased incorporation of EGFR into the final BRLp model. This relevant background knowledge also led to a gain in AUC. CONCLUSION BRLp enables tunable structure priors to be incorporated during Bayesian classification rule learning that integrates data and knowledge as demonstrated using lung cancer biomarker data.

Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery

期刊

WORLD JOURNAL OF CLINICAL ONCOLOGY

出版社

BAISHIDENG PUBLISHING GROUP INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Tunable structure priors for Bayesian rule learning for knowledge integrated biomarker discovery

期刊

WORLD JOURNAL OF CLINICAL ONCOLOGY

出版社

BAISHIDENG PUBLISHING GROUP INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文