4.3 Article

Genomic Prediction Within and Across Biparental Families: Means and Variances of Prediction Accuracy and Usefulness of Deterministic Equations

期刊

G3-GENES GENOMES GENETICS
卷 7, 期 11, 页码 3571-3586

出版社

OXFORD UNIV PRESS INC
DOI: 10.1534/g3.117.300076

关键词

genomic prediction; biparental families; plant breeding; GBLUP deterministic accuracy; linkage disequilibrium; GenPred; Shared Data Resources; Genomic Selection

资金

  1. Syngenta
  2. International Maize and Wheat Improvement Center/Gesellschaft fur Internationale Zusammenarbeit (CIMMYT/GIZ) through the Climate Resilient Maize for Asia (CRMA) [15.78600.8-001-00]

向作者/读者索取更多资源

A major application of genomic prediction (GP) in plant breeding is the identification of superior inbred lines within families derived from biparental crosses. When models for various traits were trained within related or unrelated biparental families (BPFs), experimental studies found substantial variation in prediction accuracy (PA), but little is known about the underlying factors. We used SNP marker genotypes of inbred lines from either elite germplasm or landraces of maize (Zea mays L.) as parents to generate in silico 300 BPFs of doubled-haploid lines. We analyzed PA within each BPF for 50 simulated polygenic traits, using genomic best linear unbiased prediction (GBLUP) models trained with individuals from either full-sib (FSF), half-sib (HSF), or unrelated families (URF) for various sizes (N (train)) of the training set and different heritabilities (h(2)). In addition, we modified two deterministic equations for forecasting PA to account for inbreeding and genetic variance unexplained by the training set. Averaged across traits, PA was high within FSF (0.41-0.97) with large variation only for N-train < 50 and h(2) < 0.6. For HSF and URF, PA was on average similar to 40-60% lower and varied substantially among different combinations of BPFs used for model training and prediction as well as different traits. As exemplified by HSF results, PA of across-family GP can be very low if causal variants not segregating in the training set account for a sizeable proportion of the genetic variance among predicted individuals. Deterministic equations accurately forecast the PA expected over many traits, yet cannot capture trait-specific deviations. We conclude that model training within BPFs generally yields stable PA, whereas a high level of uncertainty is encountered in across-family GP. Our study shows the extent of variation in PA that must be at least reckoned with in practice and offers a starting point for the design of training sets composed of multiple BPFs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据