4.3 Article

Utilizing Graph Theory to Select the Largest Set of Unrelated Individuals for Genetic Analysis

期刊

GENETIC EPIDEMIOLOGY
卷 37, 期 2, 页码 136-141

出版社

WILEY
DOI: 10.1002/gepi.21684

关键词

genome-wide association study; Bron-Kerbosch; cryptic relatedness; bioinformatics; sample selection

资金

  1. NHGRI [T32 HG00035, HG006493]
  2. NHLBI [HL102926]

向作者/读者索取更多资源

Many statistical analyses of genetic data rely on the assumption of independence among samples. Consequently, relatedness is either modeled in the analysis or samples are removed to clean the data of any pairwise relatedness above a tolerated threshold. Current methods do not maximize the number of unrelated individuals retained for further analysis, and this is a needless loss of resources. We report a novel application of graph theory that identifies the maximum set of unrelated samples in any dataset given a user-defined threshold of relatedness as well as all networks of related samples. We have implemented this method into an open source program called Pedigree Reconstruction and Identification of a Maximum Unrelated Set, PRIMUS. We show that PRIMUS outperforms the three existing methods, allowing researchers to retain up to 50% more unrelated samples. A unique strength of PRIMUS is its ability to weight the maximum clique selection using additional criteria (e.g. affected status and data missingness). PRIMUS is a permanent solution to identifying the maximum number of unrelated samples for a genetic analysis.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据