4.7 Article

Machine learning models outperform deep learning models, provide interpretation and facilitate feature selection for soybean trait prediction

期刊

BMC PLANT BIOLOGY
卷 22, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12870-022-03559-z

关键词

Machine learning; XGBoost; Interpretable models; Feature selection; Genomic selection; Soybean

资金

  1. Australian Government
  2. Government of Western Australia
  3. Australia Research Council [DP1601004497, LP160100030, DP200100762, DE210100398]
  4. Grains Research and Development Corporation [9177539, 9177591]
  5. Forrest Research Foundation
  6. Australian Research Council [DE210100398] Funding Source: Australian Research Council

向作者/读者索取更多资源

Recent growth in crop genomic and trait data has provided opportunities for faster crop improvement. Research shows that machine learning outperforms deep learning in genotype to phenotype prediction, and identified loci overlap with GWAS and previous studies. Feature importance rankings can reduce marker input and maintain or improve prediction performance.
Recent growth in crop genomic and trait data have opened opportunities for the application of novel approaches to accelerate crop improvement. Machine learning and deep learning are at the forefront of prediction-based data analysis. However, few approaches for genotype to phenotype prediction compare machine learning with deep learning and further interpret the models that support the predictions. This study uses genome wide molecular markers and traits across 1110 soybean individuals to develop accurate prediction models. For 13/14 sets of predictions, XGBoost or random forest outperformed deep learning models in prediction performance. Top ranked SNPs by F-score were identified from XGBoost, and with further investigation found overlap with significantly associated loci identified from GWAS and previous literature. Feature importance rankings were used to reduce marker input by up to 90%, and subsequent models maintained or improved their prediction performance. These findings support interpretable machine learning as an approach for genomic based prediction of traits in soybean and other crops.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据