4.8 Article

Machine learning prediction of lignin content in poplar with Raman spectroscopy

Journal

BIORESOURCE TECHNOLOGY
Volume 348, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.biortech.2022.126812

Keywords

Raman spectroscopy; Lignin content; Machine learning; XGBoost; Gradient boosting machine

Funding

  1. National Key Research and Development Program [2017YFD0600201]
  2. National Natural Science Foundation of China [31770596]
  3. State Key Laboratory of BioFibers and Eco-Textiles [K2019-13]

Ask authors/readers for more resources

Based on extracted features from Raman spectra, prediction models for lignin content in poplar were developed using regularization algorithms (SVR, DT, RF, LightGBM, CatBoost, and XGBoost). The results showed that RF, LightGBM, CatBoost, and XGBoost outperformed the other algorithms, with test R-2 values >0.91, indicating close prediction to measured values.
Based on features extracted from Raman spectra, regularization algorithms, SVR, DT, RF, LightGBM, CatBoost, and XGBoost were used to develop prediction models for lignin content in poplar. Firstly, Raman features extracted from FT-Raman spectra after data processing were used as input of models and determined lignin contents were output. Secondly, grid-search combined with cross-validation was used to adjust the hyperparameters of models. Finally, the predictive models were built by aforementioned algorithms. The results indicated regularization algorithms, SVR, DT held test R-2 were >0.80 which means the predictive values from model still deviate from measured ones. Meanwhile, RF, LightGBM, CatBoost, and XGBoost were better than above algorithms, and their test R-2 were >0.91 which suggesting the predictive values was nearly close to measured ones. Therefore, fast and accurate methods for predicting lignin content were obtained and will be useful for screening suitable lignocellulosic resource with expected lignin content.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available