4.7 Article

Ensemble and single algorithm models to handle multicollinearity of UAV vegetation indices for predicting rice biomass

Journal

COMPUTERS AND ELECTRONICS IN AGRICULTURE
Volume 205, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.compag.2023.107621

Keywords

Rice; Algorithm; Vegetation index; Unmanned aerial vehicle; Multicollinearity

Ask authors/readers for more resources

This study compares the model performance, variance, stability, and confidence of base and ensemble machine learning models in the context of multicollinearity and non-multicollinearity for predicting rice biomass. The experiment shows that ensemble machine learning models outperform base models for predicting all rice biomass traits in the multicollinearity context. Base and ensemble machine learning models exhibit inconsistent patterns of R2 and RMSE variances in both multicollinearity and non-multicollinearity contexts. Multicollinearity and the base-ensemble machine learning concept do not affect model confidence, which is subject to the cross-effects of machine learning and dataset characteristics.
Rice biomass is a biofuel's source and yield indicator. Conventional sampling methods predict rice biomass accurately. However, these methods are destructive, time-consuming, expensive, and labour-intensive. Instead, unmanned aerial vehicles (UAVs) cover such shortcomings by providing rice-attribute-sensitive vegetation indices (VIs). Nevertheless, VIs are collinear, and their analyses require machine learning algorithms (MLs). The analysis of collinear VIs using base (single) and ensemble MLs is yet to be investigated. Therefore, this study aims to compare the base and ensemble MLs' model performance, variance, stability (under/overfitting), and confidence for rice biomass prediction in multicollinearity context (MCC) and non-multicollinearity context (NMCC). To that end, a randomised complete block design experiment was held in the IADA KETARA rice granary in Terengganu, Malaysia. The experiment resulted in 360 samples of five biomass traits, five spectral bands, and ninety VIs. The MLs model performance and under/overfitting were better in MCC than in NMCC for predicting all rice biomass traits. The ensemble MLs outperformed the base MLs for predicting all rice biomass traits in MCC and NMCC. All base and ensemble MLs achieved inconsistent patterns of R2 and RMSE variances in MCC and NMCC. Finally, multicollinearity and the base-ensemble MLs concept did not affect the model confidence; rather, the latter was subject to the cross-effects of the ML and dataset characteristics. The present study significantly reveals the level of different base and ensemble MLs' sensitivity to multicollinearity regarding model performance, stability, variance, and confidence.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available