4.5 Article

Accuracy of genomic prediction using imputed whole-genome sequence data in white layers

Journal

JOURNAL OF ANIMAL BREEDING AND GENETICS
Volume 133, Issue 3, Pages 167-179

Publisher

WILEY
DOI: 10.1111/jbg.12199

Keywords

Genomic prediction accuracy; whole-genome sequence; causal mutations; imputation; biological information

Funding

  1. Agriculture and Food Research Initiative from the USDA National Institute of Food and Agriculture [2009-65205-05665]
  2. Wageningen University
  3. Hendrix Genetics, the Netherlands
  4. Dutch Ministry of Economic Affairs, Agriculture, and Innovation [KB-12-006.03-005-ASG-LR]
  5. NIFA [581820, 2009-65205-05665] Funding Source: Federal RePORTER

Ask authors/readers for more resources

There is an increasing interest in using whole-genome sequence data in genomic selection breeding programmes. Prediction of breeding values is expected to be more accurate when whole-genome sequence is used, because the causal mutations are assumed to be in the data. We performed genomic prediction for the number of eggs in white layers using imputed whole-genome resequence data including similar to 4.6 million SNPs. The prediction accuracies based on sequence data were compared with the accuracies from the 60 K SNP panel. Predictions were based on genomic best linear unbiased prediction (GBLUP) as well as a Bayesian variable selection model (BayesC). Moreover, the prediction accuracy from using different types of variants (synonymous, non-synonymous and non-coding SNPs) was evaluated. Genomic prediction using the 60 K SNP panel resulted in a prediction accuracy of 0.74 when GBLUP was applied. With sequence data, there was a small increase (similar to 1%) in prediction accuracy over the 60 K genotypes. With both 60 K SNP panel and sequence data, GBLUP slightly outperformed BayesC in predicting the breeding values. Selection of SNPs more likely to affect the phenotype (i.e. non-synonymous SNPs) did not improve the accuracy of genomic prediction. The fact that sequence data were based on imputation from a small number of sequenced animals may have limited the potential to improve the prediction accuracy. A small reference population (n = 1004) and possible exclusion of many causal SNPs during quality control can be other possible reasons for limited benefit of sequence data. We expect, however, that the limited improvement is because the 60 K SNP panel was already sufficiently dense to accurately determine the relationships between animals in our data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available