4.7 Article

Crop yield prediction via explainable AI and interpretable machine learning: Dangers of black box models for evaluating climate change impacts on crop yield

Journal

AGRICULTURAL AND FOREST METEOROLOGY
Volume 336, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.agrformet.2023.109458

Keywords

Statistical crop modeling; Interpretable machine learning; Explainable AI; Crop yield; Nonlinear climate effects; Technology trend

Ask authors/readers for more resources

Statistical crop modeling is crucial for understanding the impact of climate on crop yields. The choice of models is important, as linear regression is interpretable but lacks predictive power, while machine learning is highly predictive but often lacks interpretability. In this study, a Bayesian ensemble model (BM) was developed to explore historical crop yield data and predict future yields, providing both interpretability and high predictive power. BM incorporates many models via Bayesian model averaging, fits complex functions, and quantifies model uncertainty.
Statistical crop modeling is pivotal for understanding climate impacts on crop yields. Choices of models matter: Linear regression is interpretable but limited in predictive power; machine learning predicts well but often re-mains a black box. To develop explainable artificial intelligence (AI) for exploring historical crop yield data and predicting crop yield, here we reported a Bayesian ensemble model (BM) that is interpretable with great explanatory and predictive power. BM embraces many competitive models via Bayesian model averaging, fits complex functions, and quantifies model uncertainty. Long-term crop yields are driven by both climate and technology; the common practice of first detrending and then analyzing the detrended data has an incorrigible bias. Therefore, BM was also aimed at decomposing historical yield data to jointly estimate technological trends and climate effects on crop yield. We compared BM with ElasticNet, Neural Network, MARS, SVM, Random Forests, and XGBoost. BM excelled at both predicting and explaining. When tested on synthetic data, BM was the only method unveiling the true relationships: BM has stronger interpretability; other methods predicted well but for wrong reasons. When tested on maize yield data in Ohio, BM detected two technological shifts, attributable to hybrid corn adoption in the 1940 ' s and the technological slowing-down in the 1970 ' s: No other methods detected such changepoints. BM derived nonlinear asymmetric crop responses to climate and non-negligible temperature -precipitation interacting effects, with patterns consistent with theoretical or experimental evidence. Extrapola-tion of all the models for future yield prediction was highly uncertain, but BM provided more reliable predictions under novel climate whereas Random Forests and XGBoost proved unsuitable for extrapolation. Overall, BM provided new insights unattainable by the existing black-box methods. We caution against blind use of black-box machine learning for statistical crop modeling and call for more efforts to apply interpretable machine learning for mechanistic understandings of crop-climate interactions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available