Journal
PLOS GENETICS
Volume 2, Issue 12, Pages 2074-2093Publisher
PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pgen.0020190
Keywords
-
Categories
Funding
- NHGRI NIH HHS [R01 HG006399] Funding Source: Medline
- Wellcome Trust Funding Source: Medline
Ask authors/readers for more resources
Current methods for inferring population structure from genetic data do not provide formal significance tests for population differentiation. We discuss an approach to studying population structure ( principal components analysis) that was first applied to genetic data by Cavalli-Sforza and colleagues. We place the method on a solid statistical footing, using results from modern statistics to develop formal significance tests. We also uncover a general phase change'' phenomenon about the ability to detect structure in genetic data, which emerges from the statistical theory we use, and has an important implication for the ability to discover structure in genetic data: for a fixed but large dataset size, divergence between two populations (as measured, for example, by a statistic like F-ST) below a threshold is essentially undetectable, but a little above threshold, detection will be easy. This means that we can predict the dataset size needed to detect structure.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available