☆ 4.7 Article

Ensemble and single algorithm models to handle multicollinearity of UAV vegetation indices for predicting rice biomass

COMPUTERS AND ELECTRONICS IN AGRICULTURE (2023)

Journal

COMPUTERS AND ELECTRONICS IN AGRICULTURE

Volume 205, Issue -, Pages -

Publisher

ELSEVIER SCI LTD

DOI: 10.1016/j.compag.2023.107621

Keywords

Rice; Algorithm; Vegetation index; Unmanned aerial vehicle; Multicollinearity

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This study compares the model performance, variance, stability, and confidence of base and ensemble machine learning models in the context of multicollinearity and non-multicollinearity for predicting rice biomass. The experiment shows that ensemble machine learning models outperform base models for predicting all rice biomass traits in the multicollinearity context. Base and ensemble machine learning models exhibit inconsistent patterns of R2 and RMSE variances in both multicollinearity and non-multicollinearity contexts. Multicollinearity and the base-ensemble machine learning concept do not affect model confidence, which is subject to the cross-effects of machine learning and dataset characteristics.

Rice biomass is a biofuel's source and yield indicator. Conventional sampling methods predict rice biomass accurately. However, these methods are destructive, time-consuming, expensive, and labour-intensive. Instead, unmanned aerial vehicles (UAVs) cover such shortcomings by providing rice-attribute-sensitive vegetation indices (VIs). Nevertheless, VIs are collinear, and their analyses require machine learning algorithms (MLs). The analysis of collinear VIs using base (single) and ensemble MLs is yet to be investigated. Therefore, this study aims to compare the base and ensemble MLs' model performance, variance, stability (under/overfitting), and confidence for rice biomass prediction in multicollinearity context (MCC) and non-multicollinearity context (NMCC). To that end, a randomised complete block design experiment was held in the IADA KETARA rice granary in Terengganu, Malaysia. The experiment resulted in 360 samples of five biomass traits, five spectral bands, and ninety VIs. The MLs model performance and under/overfitting were better in MCC than in NMCC for predicting all rice biomass traits. The ensemble MLs outperformed the base MLs for predicting all rice biomass traits in MCC and NMCC. All base and ensemble MLs achieved inconsistent patterns of R2 and RMSE variances in MCC and NMCC. Finally, multicollinearity and the base-ensemble MLs concept did not affect the model confidence; rather, the latter was subject to the cross-effects of the ML and dataset characteristics. The present study significantly reveals the level of different base and ensemble MLs' sensitivity to multicollinearity regarding model performance, stability, variance, and confidence.

Ensemble and single algorithm models to handle multicollinearity of UAV vegetation indices for predicting rice biomass

Journal

COMPUTERS AND ELECTRONICS IN AGRICULTURE

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Ensemble and single algorithm models to handle multicollinearity of UAV vegetation indices for predicting rice biomass

Journal

COMPUTERS AND ELECTRONICS IN AGRICULTURE

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper