☆ 4.7 Article

Exploring Deep Learning for Complex Trait Genomic Prediction in Polyploid Outcrossing Species

FRONTIERS IN PLANT SCIENCE (2020)

期刊

FRONTIERS IN PLANT SCIENCE

卷 11, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA

DOI: 10.3389/fpls.2020.00025

关键词

genomic prediction; genomic selection; polyploid species; deep learning; epistasis; complex traits; strawberry; blueberry

类别

Plant Sciences

资金

Ministry of Economy and Science (MINECO, Spain)
MINECO [AGL2016-78709-R]
EU (MINECO/AEI/FEDER, EU) [BFU2016-77236-P]
Centro de Excelencia Severo Ochoa 2016-2019 award [SEV-2015-0533]
US Department of Agriculture/National Institute of Food and Agriculture Specialty Crop Research Initiative (SCRI) project 'RosBREED: Combining disease resistance with horticultural quality in new rosaceous cultivars' [2014-51181-22378]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Genomic prediction (GP) is the procedure whereby the genetic merits of untested candidates are predicted using genome wide marker information. Although numerous examples of GP exist in plants and animals, applications to polyploid organisms are still scarce, partly due to limited genome resources and the complexity of this system. Deep learning (DL) techniques comprise a heterogeneous collection of machine learning algorithms that have excelled at many prediction tasks. A potential advantage of DL for GP over standard linear model methods is that DL can potentially take into account all genetic interactions, including dominance and epistasis, which are expected to be of special relevance in most polyploids. In this study, we evaluated the predictive accuracy of linear and DL techniques in two important small fruits or berries: strawberry and blueberry. The two datasets contained a total of 1,358 allopolyploid strawberry (2n=8x=112) and 1,802 autopolyploid blueberry (2n=4x=48) individuals, genotyped for 9,908 and 73,045 single nucleotide polymorphism (SNP) markers, respectively, and phenotyped for five agronomic traits each. DL depends on numerous parameters that influence performance and optimizing hyperparameter values can be a critical step. Here we show that interactions between hyperparameter combinations should be expected and that the number of convolutional filters and regularization in the first layers can have an important effect on model performance. In terms of genomic prediction, we did not find an advantage of DL over linear model methods, except when the epistasis component was important. Linear Bayesian models were better than convolutional neural networks for the full additive architecture, whereas the opposite was observed under strong epistasis. However, by using a parameterization capable of taking into account these non-linear effects, Bayesian linear models can match or exceed the predictive accuracy of DL. A semiautomatic implementation of the DL pipeline is available at https://github.com/lauzingaretti/deepGP/..

Exploring Deep Learning for Complex Trait Genomic Prediction in Polyploid Outcrossing Species

期刊

FRONTIERS IN PLANT SCIENCE

出版社

FRONTIERS MEDIA SA

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Exploring Deep Learning for Complex Trait Genomic Prediction in Polyploid Outcrossing Species

期刊

FRONTIERS IN PLANT SCIENCE

出版社

FRONTIERS MEDIA SA

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文