4.7 Article

Integrating satellite and climate data to predict wheat yield in Australia using machine learning approaches

期刊

AGRICULTURAL AND FOREST METEOROLOGY
卷 274, 期 -, 页码 144-159

出版社

ELSEVIER
DOI: 10.1016/j.agrformet.2019.03.010

关键词

Wheat; Crop yield prediction; Enhanced vegetation index; Solar-induced fluorescence; Machine learning

资金

  1. Blue Waters Professorship
  2. NASA New Investigator Award [NNX16AI56G]
  3. University of Illinois ACES Future Interdisciplinary Research Explorations (FIRE) grant
  4. Illinois Distinguished Graduate Fellowship
  5. National Science Foundation [1443080, 0725070, 1238993]
  6. state of Illinois
  7. NASA [903006, NNX16AI56G] Funding Source: Federal RePORTER
  8. Direct For Computer & Info Scie & Enginr
  9. Office of Advanced Cyberinfrastructure (OAC) [0725070, 1443080] Funding Source: National Science Foundation
  10. Direct For Computer & Info Scie & Enginr
  11. Office of Advanced Cyberinfrastructure (OAC) [1238993] Funding Source: National Science Foundation

向作者/读者索取更多资源

Wheat is the most important staple crop grown in Australia, and Australia is one of the top wheat exporting countries globally. Timely and reliable wheat yield prediction in Australia is important for regional and global food security. Prior studies use either climate data, or satellite data, or a combination of these two to build empirical models to predict crop yield. However, though the performance of yield prediction using empirical methods is improved by combining the use of climate and satellite data, the contributions from different data sources are still not clear. In addition, how the regression-based methods compare with various machine learning based methods in their performance in yield prediction is also not well understood and needs in-depth investigation. This work integrated various sources of data to predict wheat yield across Australia from 2000 to 2014 at the statistical division (SD) level. We adopted a well-known regression method (LASSO, as a benchmark) and three mainstream machine learning methods (support vector machine, random forest, and neural network) to build various empirical models for yield prediction. For satellite data, we used the enhanced vegetation index (EVI) from MODIS and solar-induced chlorophyll fluorescence (SIF) from GOME-2 and SCIAMACHY as metrics to approximate crop productivity. The machine-learning based methods outperform the regression method in modeling crop yield. Our results confirm that combining climate and satellite data can achieve high performance of yield prediction at the SD level (R-2 similar to 0.75). The satellite data track crop growth condition and gradually capture the variability of yield evolving with the growing season, and their contributions to yield prediction usually saturate at the peak of the growing season. Climate data provide extra and unique information beyond what the satellite data have offered for yield prediction, and our empirical modeling work shows the added values of climate variables exist across the whole season, not only at some certain stages. We also find that using EVI as an input can achieve better performance in yield prediction than SIF, primarily due to the large noise in the satellite-based SIF data (i.e. coarse resolution in both space and time). In addition, we also explored the potential for timely wheat yield prediction in Australia, and we can achieve the optimal prediction performance with approximately two-month lead time before wheat maturity. The proposed methodology in this paper can be extended to different crops and different regions for crop yield prediction.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据