4.7 Article

Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction

Journal

Publisher

BMC
DOI: 10.1186/s40104-020-00515-5

Keywords

Accuracy; Drosophila melanogaster; Genomic prediction; Multi-omics data; SNP preselection

Funding

  1. National Natural Science Foundation of China [31772556]
  2. Local Innovative and Research Teams Project of Guangdong Province [2019BT02N630]
  3. earmarked fund for China Agriculture Research System [CARS-35]
  4. Science and Technology Innovation Strategy projects of Guangdong Province [2018B020203002]

Ask authors/readers for more resources

Background Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level. Therefore, using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction. In this study, simultaneously using whole-genome sequencing (WGS) and gene expression level data, four strategies for single-nucleotide polymorphism (SNP) preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel. Results Using genomic best linear unbiased prediction (GBLUP) with complete WGS data, the prediction accuracies were 0.208 +/- 0.020 (0.181 +/- 0.022) for the startle response and 0.272 +/- 0.017 (0.307 +/- 0.015) for starvation resistance in the female (male) lines. Compared with GBLUP using complete WGS data, both GBLUP and the genomic feature BLUP (GFBLUP) did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies (GWASs) or transcriptome-wide association studies (TWASs). Furthermore, by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus (eQTL) mapping of all genes, only the startle response had greater accuracy than GBLUP with the complete WGS data. The best accuracy values in the female and male lines were 0.243 +/- 0.020 and 0.220 +/- 0.022, respectively. Importantly, by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS, both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction. Compared with the GBLUP using complete WGS data, the best accuracy values represented increases of 60.66% and 39.09% for the starvation resistance and 27.40% and 35.36% for startle response in the female and male lines, respectively. Conclusions Overall, multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction. The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available