3.9 Article

Dimensionality Reduction Statistical Models for Soil Attribute Prediction Based on Raw Spectral Data

Journal

AI
Volume 3, Issue 4, Pages 809-819

Publisher

MDPI
DOI: 10.3390/ai3040049

Keywords

multivariate analysis; visible near-infrared; X-ray fluorescence

Funding

  1. CNPq [458180/2014-9]
  2. Sao Paulo Research Foundation (FAPESP) [2020/16670-9]

Ask authors/readers for more resources

This study evaluated the predictive performance of two dimensionality reduction statistical models (PCR and lasso) for modeling soil spectral data without pretreatment techniques. The results showed that PCR and lasso achieved good performance in predicting soil attributes using raw spectral data. The comparison with literature results that employed pretreatment techniques indicated similar performance. However, there was no consensus on the best calibration approach, which seemed to be attribute and area specific.
To obtain a better performance when modeling soil spectral data for attribute prediction, researchers frequently resort to data pretreatment, aiming to reduce noise and highlight the spectral features. Even with the awareness of the existence of dimensionality reduction statistical approaches that can cope with data sparse dimensionality, few studies have explored its applicability in soil sensing. Therefore, this study's objective was to assess the predictive performance of two dimensionality reduction statistical models that are not widespread in the proximal soil sensing community: principal components regression (PCR) and least absolute shrinkage and selection operator (lasso). Here, these two approaches were compared with multiple linear regressions (MLR). All of the modelling strategies were applied without employing pretreatment techniques for soil attribute determination using X-ray fluorescence spectroscopy (XRF) and visible and near-infrared diffuse reflectance spectroscopy (Vis-NIR) data. In addition, the achieved results were compared against the ones reported in the literature that applied pretreatment techniques. The study was carried out with 102 soil samples from two distinct fields. Predictive models were developed for nine chemical and physical soil attributes, using lasso, PCR and MLR. Both Vis-NIR and XRF raw spectral data presented a great performance for soil attribute prediction when modelled with PCR and the lasso method. In general, similar results were found comparing the root mean squared error (RMSE) and coefficient of determination (R-2) from the literature that applied pretreatment techniques and this study. For example, considering base saturation (V%), for Vis-NIR combined with PCR, in this study, RMSE and R-2 values of 10.60 and 0.79 were found compared with 10.38 and 0.80, respectively, in the literature. In addition, looking at potassium (K), XRF associated with lasso yielded an RMSE value of 0.60 and R-2 of 0.92, and in the literature, RMSE and R-2 of 0.53 and 0.95, respectively, were found. The major discrepancy was observed for phosphorus (P) and organic matter (OM) prediction applying PCR in the XRF data, which showed R-2 of 0.33 (for P) and 0.52 (for OM) without using pretreatment techniques in this study, and R-2 of 0.01 (for P) and 0.74 (for OM) when using preprocessing techniques in the literature. These results indicate that data pretreatment can be disposable for predicting some soil attributes when using Vis-NIR and XRF raw data modeled with dimensionality reduction statistical models. Despite this, there is no consensus on the best way to calibrate data, as this seems to be attribute and area specific.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.9
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available