4.6 Article

Extensive simulations assess the performance of genome-wide association mapping in various Saccharomyces cerevisiae subpopulations

Publisher

ROYAL SOC
DOI: 10.1098/rstb.2020.0514

Keywords

genome-wide association study; population genomics; complex trait; variant mapping; yeast

Categories

Funding

  1. European Research Council (ERC) [772505]
  2. European Research Council (ERC) [772505] Funding Source: European Research Council (ERC)

Ask authors/readers for more resources

With the development of high throughput sequencing technologies, genome-wide association studies (GWAS) have become a powerful tool for analyzing the genetic origins of phenotypic variation. This study evaluated the feasibility and limitations of GWAS by sequencing the genomes of 1011 Saccharomyces cerevisiae isolates. The results showed that population structure is not a major issue when the sample size is large enough. While the genetic determinant of Mendelian traits is easily mapped, there are discrepancies in GWAS for complex traits in terms of detection, false positive, and false negative rates.
With the advent of high throughput sequencing technologies, genome-wide association studies (GWAS) have become a powerful paradigm for dissecting the genetic origins of the observed phenotypic variation. We recently completely sequenced the genome of 1011 Saccharomyces cerevisiae isolates, laying a strong foundation for GWAS. To assess the feasibility and the limits of this approach, we performed extensive simulations using five selected subpopulations as well as the total set of 1011 genomes. We measured the ability to detect the causal genetic variants involved in Mendelian and more complex traits using a linear mixed model approach. The results showed that population structure is well accounted for and is not the main problem when the sample size is high enough. While the genetic determinant of a Mendelian trait is easily mapped in all studied subpopulations, discrepancies are seen between datasets when performing GWAS on a complex trait in terms of detection, false positive and false negative rate. Finally, we performed GWAS on the different defined subpopulations using a real quantitative trait (resistance to copper sulfate) and showed the feasibility of this approach. The performance of each dataset depends simultaneously on several factors such as sample size, relatedness and population evolutionary history.This article is part of the theme issue 'Genetic basis of adaptation and speciation: from loci to causative mutations'.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available