4.7 Article

Estimating and interpreting FST: The impact of rare variants

Journal

GENOME RESEARCH
Volume 23, Issue 9, Pages 1514-1521

Publisher

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/gr.154831.113

Keywords

-

Funding

  1. NIH [T32 HG002295, R01 HG006399, R03 HG006170]

Ask authors/readers for more resources

In a pair of seminal papers, Sewall Wright and Gustave Malecot introduced F-ST as a measure of structure in natural populations. In the decades that followed, a number of papers provided differing definitions, estimation methods, and interpretations beyond Wright's. While this diversity in methods has enabled many studies in genetics, it has also introduced confusion regarding how to estimate F-ST from available data. Considering this confusion, wide variation in published estimates of F-ST for pairs of HapMap populations is a cause for concern. These estimates changed-in some cases more than twofold-when comparing estimates from genotyping arrays to those from sequence data. Indeed, changes in F-ST from sequencing data might be expected due to population genetic factors affecting rare variants. While rare variants do influence the result, we show that this is largely through differences in estimation methods. Correcting for this yields estimates of F-ST that are much more concordant between sequence and genotype data. These differences relate to three specific issues: (1) estimating F-ST for a single SNP, (2) combining estimates of F-ST across multiple SNPs, and (3) selecting the set of SNPs used in the computation. Changes in each of these aspects of estimation may result in F-ST estimates that are highly divergent from one another. Here, we clarify these issues and propose solutions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available