4.5 Article

Kernel Machine Approach to Testing the Significance of Multiple Genetic Markers for Risk Prediction

期刊

BIOMETRICS
卷 67, 期 3, 页码 975-986

出版社

WILEY
DOI: 10.1111/j.1541-0420.2010.01544.x

关键词

Gene-set analysis; Genetic association; Genetic pathways; Kernel machine; Kernel PCA; Risk prediction; Score test; Survival analysis

资金

  1. National Institute of General Medical Sciences [R01-GM079330-03]
  2. National Science Foundation [DMS 0854970]
  3. National Cancer Institute [R37-CA076404, P01-CA134294]
  4. Direct For Mathematical & Physical Scien
  5. Division Of Mathematical Sciences [0854970] Funding Source: National Science Foundation

向作者/读者索取更多资源

There is growing evidence that genomic and proteomic research holds great potential for changing irrevocably the practice of medicine. The ability to identify important genomic and biological markers for risk assessment can have a great impact in public health from disease prevention, to detection, to treatment selection. However, the potentially large number of markers and the complexity in the relationship between the markers and the outcome of interest impose a grand challenge in developing accurate risk prediction models. The standard approach to identifying important markers often assesses the marginal effects of individual markers on a phenotype of interest. When multiple markers relate to the phenotype simultaneously via a complex structure, such a type of marginal analysis may not be effective. To overcome such difficulties, we employ a kernel machine Cox regression framework and propose an efficient score test to assess the overall effect of a set of markers, such as genes within a pathway or a network, on survival outcomes. The proposed test has the advantage of capturing the potentially nonlinear effects without explicitly specifying a particular nonlinear functional form. To approximate the null distribution of the score statistic, we propose a simple resampling procedure that can be easily implemented in practice. Numerical studies suggest that the test performs well with respect to both empirical size and power even when the number of variables in a gene set is not small compared to the sample size.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据