4.4 Article

A SPECTRAL GRAPH APPROACH TO DISCOVERING GENETIC ANCESTRY

期刊

ANNALS OF APPLIED STATISTICS
卷 4, 期 1, 页码 179-202

出版社

INST MATHEMATICAL STATISTICS
DOI: 10.1214/09-AOAS281

关键词

Human genetics; dimension reduction; multidimensional scaling; population structure; spectral embedding

资金

  1. NIH [MH057881]
  2. ONR [N0014-08-1-0673]

向作者/读者索取更多资源

Mapping human genetic variation is fundamentally interesting in fields such as anthropology and forensic inference. At the same time, patterns of genetic diversity confound efforts to determine the genetic basis of complex disease. Due to technological advances, it is now possible to measure hundreds of thousands of genetic variants per individual across the genome. Principal component analysis (PCA) is routinely used to summarize the genetic similarity between subjects. The eigenvectors are interpreted as dimensions of ancestry. We build on this idea using a spectral graph approach. In the process we draw on connections between multidimensional scaling and spectral kernel methods. Our approach, based on a spectral embedding derived from the normalized Laplacian of a graph, can produce more meaningful delineation of ancestry than by using PCA. The method is stable to outliers and can more easily incorporate different similarity measures of genetic data than PCA. We illustrate a new algorithm for genetic clustering and association analysis on a large, genetically heterogeneous sample.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据