4.7 Article

Comparison of bagging, boosting and stacking algorithms for surface soil moisture mapping using optical-thermal-microwave remote sensing synergies

Journal

CATENA
Volume 217, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.catena.2022.106485

Keywords

Machine learning; Stacking; Soil moisture; Sentinel-1; Landsat-8

Funding

  1. National Fellow project of the Indian Council of Agricultural Research (ICAR) , New Delhi
  2. ICAR [PGSII/78-02/PDF/2020-21/102]
  3. Head and Fellow Scientists in the Division of Agricultural Physics, IARI

Ask authors/readers for more resources

Soil moisture information is crucial for water resource management, drought monitoring, and agricultural yield prediction. This study improved the prediction accuracy of soil moisture by fusing optical, thermal, and microwave remote sensing data using machine learning techniques, and stacking multiple machine learning models was recommended for digital soil moisture mapping.
Soil moisture information is key to irrigation water management, drought monitoring, and yield prediction. It plays a vital role in the water cycle and energy budget between the earth's surface and atmosphere. Hence, its monitoring is crucial for both natural and anthropogenic environments. While the current remote sensing-based global SM products available at coarser resolution (3/15 km) are unsuitable for field-level operations, the most widely used microwave remote sensing suffers from model complexities and in-situ data requirements. Weather conditions limit the alternate approaches such as optical/thermal. This study aims to map surface soil moisture (SSM) at 30 m spatial resolution in a semi-arid region by fusing optical, thermal, and microwave remote sensing data using bagging, boosting, and stacking machine learning approaches. The reference data were collected using a soil moisture meter. The covariates included radar backscatter from Sentinel-1, visible, near-infrared, shortwave infrared, land surface temperature, and spectral indices derived from Landsat 8. Boruta algorithm was used for feature selection which identified radar backscatter, modified normalized difference water index, and land surface temperature as the most critical covariates impacting the SSM. The random forest (RF) showed the highest correlation coefficient (r = 0.71), and least root mean square error (RMSE = 5.17%). The cubist model had the least mean bias error (MBE = 0.21%) during independent validation. Stacking of cubist, gradient boosting machine (GBM), and RF using elastic net (ELNET) as meta-learner further reduced the MBE (0.18%) and RMSE (5.03%) during the validation. Overall, stacking multiple machine learning models improved model prediction and can be recommended to improve the digital soil moisture mapping.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available