4.7 Article

Discovering weather periods and crop properties favorable for coffee rust incidence from feature selection approaches

Journal

COMPUTERS AND ELECTRONICS IN AGRICULTURE
Volume 176, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.compag.2020.105640

Keywords

Coffea arabica; Hemileia vastatrix; Crop disease; Dimensionality reduction; Machine learning; Model explanation

Funding

  1. Telematics Engineering Group (GIT) of the University of Cauca
  2. Tropical Agricultural Research and Higher Education Center (CATIE)
  3. InnovAccion Cauca project of the Colombian Science, Technology and Innovation Fund (SGR-CTI)
  4. Red de formacion de talento humano para la innovacion social y productiva en el departamento del Cauca InnovAccion Cauca [5271]

Ask authors/readers for more resources

Coffee Leaf Rust (CLR) is a disease that leads to considerable losses in the worldwide coffee industry; as those that have been reported recently in Colombia and Central America. The early detection of favorable conditions for epidemics could be used to improve decision making for the coffee grower and thus reduce the losses due to the disease. Researchers tried to predict the occurrence of the disease earlier through statistical and machine learning models from crop properties, disease indicators and weather conditions. These studies considered the impact of weather variables in a common period for all. Assuming that the dynamics of weather that most impact the development of the disease occur in the same time periods is simplistic. We propose an approach to discover the time period (window) for each weather variables and crop related features that most explain a future ob-served CLR incidence, in order to obtain a prediction model through machine learning. The selection of the variables more related with coffee rust incidence and rejection of the features with no significant contribution of information in machine learning tasks were approached from Feature Selection methods (Filter, Wrapper, Embedded). In this way, a CLR incidence prediction model based on the features with the greatest impact on the development of the disease was obtained. Moreover, the use of SHapley Additive exPlanations allowed us to identify the impact of features in the model prediction. The monitoring of coffee rust incidence is the most important predictor, since it provides information about current inoculum and this determines how much can the incidence grow or decrease. Temperature is a determining driver for germination and penetration phases in days 9 to 6 and 4 to 1 before the date of prediction. Additionally, the amount of rain determines whether uredospore dispersal or washing conditions occurred. The mean absolute error expected in the model is 6.94% of incidence, trained with XGBoost algorithm and the dataset reduced by Embedded method. The estimation of the disease incidence 28 days later can be used to improve decision making in control and nutrition practices.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available