4.8 Article

Prediction and analysis of net ecosystem carbon exchange based on gradient boosting regression and random forest

Journal

APPLIED ENERGY
Volume 262, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.apenergy.2020.114566

Keywords

Net ecosystem carbon exchange; Variable importance analysis; Gradient boosting regression; Random forest; Prediction model

Funding

  1. National Natural Science Foundation of China [41722403]
  2. Natural Science Foundation of Hubei Province [2018CFB259, 2018CFA051]
  3. Fundamental Research Funds for the Central Universities [2462019YJRC011]
  4. CDIAC Ecosystem Thematic Center
  5. ICOS Ecosystem Thematic Center
  6. OzFlux office
  7. ChinaFlux office
  8. AsiaFlux office

Ask authors/readers for more resources

Carbon balance is essential to keep ecosystems sustainable and healthy. Net ecosystem carbon exchange (NEE), which is affected by a bunch of meteorological variables to different extent, helps to gauge the balance of the carbon cycle between biological organisms and atmosphere. In this study, the NEE data is collected from two flux measuring sites. Gradient boosting regression algorithm is employed to predict NEE based on the meteorology and flux data from site UK-Gri. During the training process, KFold cross-validation algorithm is implemented to avoid overfitting, and random forest algorithm is implemented to identify the important variables influencing NEE mostly. The four most important variables are found to be global radiation, photosynthetic active radiation, minimum soil temperature, and latent heat. The regression model was compared with three state-of-the-art prediction models: support vector machine, stochastic gradient descent, and bayesian ridge to verify its performance. The experimental results show that this regression model outperforms the other three models, and gives higher value of R-squared, lower values of mean absolute error and root mean squared error. To verify the regression model's generalization ability, the data from the second flux site, NL-Loo, was employed, and the hybrid data of the two sites was used. The results show that this model performs well on the hybrid data, too. In practical terms, the gradient boosting regression model provides many tunable hypterparameters and loss functions, which make it more flexible and accurate compared to the other three models. This study has conclusively demonstrated for the first time that the combination of gradient boosting regression and random forest models should be considered as valuable tools to make effective prediction for NEE and acquire reliable important variables influencing NEE mostly. The methodologies could be useful in the research fields of ecosystem stability evaluation, environmental restoration, trend analysis of climate change, and global warming monitoring.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available