3.8 Article

Criteria of GenCall score to edit marker data and methods to handle missing markers have an influence on accuracy of genomic predictions

Journal

ARCHIV FUR TIERZUCHT-ARCHIVES OF ANIMAL BREEDING
Volume 56, Issue -, Pages 778-788

Publisher

ARCHIV FUR TIERZUCHT
DOI: 10.7482/0003-9438-56-077

Keywords

GenCall scores; missing genotype; imputation; genomic prediction

Funding

  1. project Genomic Selection - From function to efficient utilisation in cattle breeding [3405-10-0137]
  2. Danish Directorate for Food, Fisheries and Agri Business (Copenhagen, Denmark)
  3. Milk Levy Fund (Aarhus, Denmark)
  4. VikingGenetics (Randers, Denmark)
  5. Nordic Cattle Genetic Evaluation (Aarhus, Denmark)
  6. Aarhus University (Aarhus, Denmark)

Ask authors/readers for more resources

The aim of this study was to investigate the effect of different strategies for handling low-quality or missing data on prediction accuracy for direct genomic values of protein yield, mastitis and fertility using a Bayesian variable model and a GBLUP model in the Danish Jersey population. The data contained 1 071 Jersey bulls that were genotyped with the Illumina Bovine 50K chip. After preliminary editing, 39 227 single nucleotide polymorphism (SNPs) remained in the dataset. Four methods to handle missing genotypes were: 1) BEAGLE: missing markers were imputed using Beagle 3.3 software, 2) COMMON: missing genotypes at a locus were replaced by the most common genotype at this locus observed in the marker data, 3) EX-ALLELE: missing marker genotypes at a locus were treated as an extra allele, and 4) POP-EXP: missing genotypes at a locus were replaced with population expectation at this locus. It was shown that among the methods used in this study, the imputation with Beagle was the best approach to handle missing genotypes. Treating missing markers as a pseudo-allele, replacing missing markers with a population average or substituting the most common alleles each reduced the accuracy of genomic predictions. The results from this study suggest that missing genotypes should be imputed in order to improve genomic prediction. Editing the marker data with a stringent threshold on GenCall scores and then imputing the discarded genotypes did not lead to higher accuracy. All marker genotypes with a GenCall score over 0.15 should be retained for genomic prediction.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available