4.6 Article

The application of naive Bayes model averaging to predict Alzheimer's disease from genome-wide data

出版社

OXFORD UNIV PRESS
DOI: 10.1136/amiajnl-2011-000101

关键词

-

资金

  1. NLM [R01-LM010020, HHSN276201000030C]
  2. NSF [IIS-0911032]
  3. Direct For Computer & Info Scie & Enginr
  4. Div Of Information & Intelligent Systems [911032] Funding Source: National Science Foundation

向作者/读者索取更多资源

Objective Predicting patient outcomes from genome-wide measurements holds significant promise for improving clinical care. The large number of measurements (eg, single nucleotide polymorphisms (SNPs)), however, makes this task computationally challenging. This paper evaluates the performance of an algorithm that predicts patient outcomes from genome-wide data by efficiently model averaging over an exponential number of naive Bayes (NB) models. Design This model-averaged naive Bayes (MANB) method was applied to predict late onset Alzheimer's disease in 1411 individuals who each had 312 318 SNP measurements available as genome-wide predictive features. Its performance was compared to that of a naive Bayes algorithm without feature selection (NB) and with feature selection (FSNB). Measurement Performance of each algorithm was measured in terms of area under the ROC curve (AUC), calibration, and run time. Results The training time of MANB (16.1 s) was fast like NB (15.6 s), while FSNB (1684.2 s) was considerably slower. Each of the three algorithms required less than 0.1 s to predict the outcome of a test case. MANB had an AUG of 0.72, which is significantly better than the AUG of 0.59 by NB (p<0.00001), but not significantly different from the AUG of 0.71 by FSNB. MANB was better calibrated than NB, and FSNB was even better in calibration. A limitation was that only one dataset and two comparison algorithms were included in this study. Conclusion MANB performed comparatively well in predicting a clinical outcome from a high-dimensional genome-wide dataset. These results provide support for including MANB in the methods used to predict outcomes from large, genome-wide datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据