4.6 Article

Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data

Journal

PLOS ONE
Volume 17, Issue 1, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pone.0262024

Keywords

-

Ask authors/readers for more resources

Analysis of population genetic data often involves the search for genomic regions with signs of recent positive selection. In this study, we propose and test modifications to the concept of extended haplotype homozygosity (EHH) and its associated statistics to relax data requirements. Our findings show the indispensability of phased haplotypes for accurate estimation of within-population statistics and cross-population statistics for small samples. Ancestry information, however, is found to be less important for both types of statistics. Our modified statistics are implemented in the publicly available R package rehh.
Analysis of population genetic data often includes a search for genomic regions with signs of recent positive selection. One of such approaches involves the concept of extended haplotype homozygosity (EHH) and its associated statistics. These statistics typically require phased haplotypes, and some of them necessitate polarized variants. Here, we unify and extend previously proposed modifications to loosen these requirements. We compare the modified versions with the original ones by measuring the false discovery rate in simulated whole-genome scans and by quantifying the overlap of inferred candidate regions in empirical data. We find that phasing information is indispensable for accurate estimation of within-population statistics (for all but very large samples) and of cross-population statistics for small samples. Ancestry information, in contrast, is of lesser importance for both types of statistic. Our publicly available R package rehh incorporates the modified statistics presented here.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available