☆ 4.4 Review Book Chapter

Population Identification Using Genetic Data

ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 13 (2012)

期刊

ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 13

卷 13, 期 -, 页码 337-361

出版社

ANNUAL REVIEWS

DOI: 10.1146/annurev-genom-082410-101510

关键词

population structure; similarity measure; genetic distance; haplotypes; PCA; principal components

类别

Genetics & Heredity

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

A large number of algorithms have been developed to classify individuals into discrete populations using genetic data. Recent results show that the information used by both model-based clustering methods and principal components analysis can be summarized by a matrix of pairwise similarity measures between individuals. Similarity matrices have been constructed in a number of ways, usually treating markers as independent but differing in the weighting given to polymorphisms of different frequencies. Additionally, methods are now being developed that take linkage into account. We review several such matrices and evaluate their information content. A two-stage approach for population identification is to first construct a similarity matrix and then perform clustering. We review a range of common clustering algorithms and evaluate their performance through a simulation study. The clustering step can be performed either on the matrix or by first using a dimension-reduction technique; we find that the latter approach substantially improves the performance of most algorithms. Based on these results, we describe the population structure signal contained in each similarity matrix and find that accounting for linkage leads to significant improvements for sequence data. We also perform a comparison on real data, where we find that population genetics models outperform generic clustering approaches, particularly with regard to robustness for features such as relatedness between individuals.

Population Identification Using Genetic Data

期刊

ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 13

出版社

ANNUAL REVIEWS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Population Identification Using Genetic Data

期刊

ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 13

出版社

ANNUAL REVIEWS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文