4.6 Article

Prediction of municipality-level winter wheat yield based on meteorological data using machine learning in Hokkaido, Japan

Journal

PLOS ONE
Volume 16, Issue 10, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pone.0258677

Keywords

-

Ask authors/readers for more resources

This study developed a machine learning model to predict winter wheat yield in Hokkaido, Japan, utilizing high-resolution meteorological data and yield statistics. The model showed that precipitation, temperature, and irradiance during grain-filling period were significant factors affecting yield under wet climate conditions, with PLS, SVR, and RF outperforming traditional linear regression models. The analysis also highlighted the importance of including key meteorological variables in predicting yield accurately, demonstrating the usefulness of explainable machine learning in crop yield prediction.
This study analyzed meteorological constraints on winter wheat yield in the northern Japanese island, Hokkaido, and developed a machine learning model to predict municipality-level yields from meteorological data. Compared to most wheat producing areas, this island is characterized by wet climate owing to greater annual precipitation and abundant snowmelt water supply in spring. Based on yield statistics collected from 119 municipalities for 14 years (N = 1,516) and high-resolution surface meteorological data, correlation analyses showed that precipitation, daily minimum air temperature, and irradiance during the grain-filling period had significant effects on the yield throughout the island while the effect of snow depth in early winter and spring was dependent on sites. Using 10-d mean meteorological data within a certain period between seeding and harvest as predictor variables and one-year-leave-out cross-validation procedure, performance of machine learning models based on neural network (NN), random forest (RF), support vector machine regression (SVR), partial least squares regression (PLS), and cubist regression (CB) were compared to a multiple linear regression model (MLR) and a null model that returns an average yield of the municipality. The root mean square errors of PLS, SVR, and RF were 872, 982, and 1,024 kg ha(-1) and were smaller than those of MLR (1,068 kg ha(-1)) and null model (1,035 kg ha(-1)). These models outperformed the controls in other metrics including Pearson's correlation coefficient and Nash-Sutcliffe efficiency. Variable importance analysis on PLS indicated that minimum air temperature and precipitation during the grain-filling period had major roles in the prediction and excluding predictors in this period (i.e. yield forecast with a longer lead-time) decreased forecast performance of the models. These results were consistent with our understanding of meteorological impacts on wheat yield, suggesting usefulness of explainable machine learning in meteorological crop yield prediction under wet climate.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available