4.7 Article

Clay content mapping and uncertainty estimation using weighted model averaging

Journal

CATENA
Volume 209, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.catena.2021.105791

Keywords

Clay; Electromagnetic induction; gamma-ray spectrometry; Machine learning; Ensemble models; Digital soil mapping

Funding

  1. Australian Government Research Training Program Scholarship
  2. Australian Federal Government Cotton Research and Development Corporation
  3. Australian Federal Government Australian Cotton Cooperative Research Centre
  4. Australian Federal Government Natural Heritage Trust Program

Ask authors/readers for more resources

This study compared the importance of proximally and remotely sensed data for predicting clay at district scale, using different models and averaging techniques. The results showed that gamma-ray data was crucial for topsoil clay prediction, while slope was important for subsoil. Random forest model was found to be the best for predicting topsoil clay, with Granger-Ramanathan averaging recommended as a protocol for district-scale clay prediction.
Accurate prediction of clay is the basis for soil quality assessment and decision making in land use because it governs soil moisture and fertility dynamics. However, using laboratory methods to determine clay across a large district and at multiple depths is tedious and expensive. An alternative is to use proximally and remotely sensed digital data, that can be coupled to laboratory measured clay through models. This study aims to predict topsoil (0-0.3 m) and subsoil (0.9-1.2 m) clay at district scale by comparing; i) importance of proximally (i.e., apparent soil electrical conductivity - ECa) and remotely (i.e., gamma-ray spectrometry, digital elevation model - DEM) sensed data, ii) models including a linear mixed model (LMM) and machine learning models (MLs, i.e., Cubist, random forest [RF], support vector machine regression [SVMR], quantile regression forests [QRF], extreme gradient boosting [XGBoost] and bagEarth), iii) two model averaging techniques (i.e., Granger-Ramanathan averaging (GRA) and Lin's concordance (LCCC) weights) from the top four best models, and iv) uncertainty of the prediction. The results showed that the gamma-ray data was most important for topsoil clay prediction, while in the subsoil the slope was most important. Moreover, for topsoil clay prediction the RF was best with fair accuracy (RPD = 1.64), followed by QRF (1.62), Cubist (1.61) and LMM (1.55) which outperformed bagEarth (1.51), SVMR (1.47) and XGBoost (1.47). For the subsoil, all seven models achieved poor accuracy (RPD < 1.4) with RF (1.34) again being the best. The 90% prediction interval was larger in the subsoil compared to topsoil. Furthermore, while both averaging methods improved the prediction in both depths, the improvement using GRA was more pronounced. Therefore, we recommend the GRA be adopted as a protocol for district-scale clay prediction.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available