4.7 Article

Machine learning algorithms for rapid estimation of holocellulose content of poplar clones based on Raman spectroscopy

期刊

CARBOHYDRATE POLYMERS
卷 292, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.carbpol.2022.119635

关键词

Holocellulose content; Machine learning algorithms; Raman spectroscopy; CatBoost; XGBoost

资金

  1. National Key Research and Development Program [2017YFD0600201]
  2. National Natural Science Foundation of China [31770596]
  3. State Key Laboratory of Bio-Fibers and Eco-Textiles [K2019-13]

向作者/读者索取更多资源

In this study, regularization, classical machine learning, and advanced gradient boosting algorithms were used to build predictive models of poplar holocellulose content based on Raman spectra features. The evaluation results showed that classical machine learning algorithms outperformed regularization algorithms, and the advanced gradient boosting algorithms performed better than classical machine learning algorithms. Models built with CatBoost and XGBoost achieved high predictive accuracy for holocellulose content estimation. Therefore, Raman spectroscopy coupled with machine learning algorithms is a promising tool for predicting holocellulose content in poplar.
In this study, regularization algorithms (RR, LR, and ENR), classical ML algorithms (SVR, DT, and RF), and advanced GBM algorithms (LightGBM, CatBoost, and XGBoost) were applied to build the holocellulose content predictive models of poplar based on features extracted from Raman spectra. Evaluation results of models indicate that classical ML algorithms show higher predictive accuracy than regularization algorithms, and the advanced GBM algorithms better than the classical ML algorithms. Furthermore, models built by CatBoost and XGBoost can estimate holocellulose content with high predictive accuracy of test R(2 )above 0.93 and test RMSE less than 0.29%. It provides the heretofore best precision of holocellulose content predictive model based on Raman spectroscopy so far for our knowledge. Therefore, it is suggested that Raman spectroscopy coupled with ML algorithms is a promising tool for predicting the holocellulose content in poplar which can be applied in large-scale tree genetic and breeding programs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据