4.7 Article

Estimation of eggplant yield with machine learning methods using spectral vegetation indices

Journal

COMPUTERS AND ELECTRONICS IN AGRICULTURE
Volume 202, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.compag.2022.107367

Keywords

Crop yield prediction; Eggplant; Machine learning; Spectral vegetation indices; Remote sensing

Funding

  1. Scientific and Techno- logical Research Council of Turkey (TUBITAK)
  2. [114O538]

Ask authors/readers for more resources

Estimating crop yields is crucial for agricultural planning, and remote sensing products like VI are commonly used for this purpose. This study aimed to predict eggplant yield using VIs and machine learning methods, with the best results achieved by the ANN model based on PCA inputs.
Estimation of crop yields included in the planning is an essential condition for accurate and timely agricultural planning. Remotely sensed products, such as the spectral vegetation index (VI), are widely used in estimation of crop yields. The integration of remotely sensed data into machine learning methods will have the potential to develop a real-time management system specific to the area of interest. The main aim of the study was to determine the eggplant yield in field conditions, based on VIs obtained from a handheld spectroradiometer, using five different machine learning methods (artificial neural networks (ANN), support vector machines (SVR), k nearest neighbor (kNN), random forests (RF), and Adaptive boosting (AB)), and compare the performances of the methods. The data used in the study were obtained in field experiments focusing on determining the most suitable irrigation program for eggplant production in a semi-humid climate region in northern Turkey during 2015, 2016 and 2017 growing seasons. Irrigation treatments consisted of a total of five applications, which were full water application (I1:100 %) and different deficit ration of full water application (I2:I1x 75 %, I3: I1x50%, I4: I1x25% and I5: rainfed based). Input variables used in yield estimation models were determined by correlation analysis and principal components analysis (PCA). The inputs in the models were different combinations of 10 different VIs, the number of days after planting (DAP) and water application coefficients. In addition, an alternative approach was proposed, in which PCA components were used as input for yield estimation. All machine learning models using PCA-based inputs were estimated with higher accuracy than other input combinations. The best results were obtained with the ANN model based on PCA-based inputs; therefore, this model was chosen for eggplant yield estimation (coefficient of determination (R-2) = 0.973, mean absolute error (MAE) = 274.816 kg ha(-1), root mean square error (RMSE) = 352.787 kg ha(-1) and Nash-Sutcliffe efficiency (NSE) = 0.951). The lowest accuracy for yield estimation was recorded in RF model. The prediction accuracy of the models using a single VI as input was low. Green index (GI) and green vegetation index (GVI) had the highest impact on eggplant yield, and eggplant yield was estimated with higher accuracy with these indices, which are sensitive to chlorophyll absorption. The findings of the current study demonstrate the benefits of using remotely sensed data and PCA together in machine learning models to more reliably and accurately estimate eggplant yield at regional scale.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available