4.5 Article

Evaluation of machine learning algorithms in the prediction of hydraulic conductivity and soil moisture at the Brazilian Savannah

Journal

GEODERMA REGIONAL
Volume 30, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.geodrs.2022.e00569

Keywords

Pedotransfer functions; Irrigation; Hydrology; Oxisols

Categories

Funding

  1. Brazilian Agricultural Research Corporation
  2. Federal University of Viosa (UFV)
  3. Coordination for the Improvement of Higher Education Personnel (CAPES) [001]

Ask authors/readers for more resources

The Brazilian Savannah is the main agricultural region in Brazil, but the intensification of agriculture has led to an increase in water disputes. The lack of soil hydraulic data in the region creates uncertainty in water resources management. Machine learning algorithms, particularly random forest and support vector regression models, showed good performance in predicting soil hydraulic properties and soil moisture.
The Brazilian Savannah (Cerrado biome) is the the main agricultural region in Brazil. The Cerrado has experi-enced a growing intensification of agriculture and an increase in disputes over water use, highlighting the need to establish strategies to reduce the water withdrawn from waterbodies, mainly through irrigation. The lack of data at a proper scale on soil hydraulic properties in the region brings uncertainties to the process of water resources management. Obtaining these data, however, is difficult and costly, thus, opening the opportunity for the use of Pedotransfer Functions (PTFs). Various methods can be used to obtain PTFs, but currently, machine learning techniques are gaining strength. In this context, it becomes important to evaluate the quality of machine learning algorithms in predicting PTFs. The present work aimed to evaluate the performance of machine learning algorithms in the prediction of saturated soil hydraulic conductivity (Ks) and soil moisture at tensions of 0, 6, 10, 33, 100, and 1500 kPa for the Cerrado Biome. Four machine learning algorithms were tested: Multiple Adaptive Regression Splines (MARS), Random Forest (RF), Support Vector Regression (SVR), and K Nearest Neighbors (KNN). Four combinations of soil data were evaluated and the predictor variables used in each set were different. In set A1, the following variables were used: sand (Sa), silt (Si), and clay (Cl) contents; in set A2: Sa, Si, Cl, and bulk density (BD); in set A3: Sa, Si, Cl, BD, particle density (Dp), total porosity (Pt), microporosity (Micro), and macroporosity (Macro); and in set A4: Sa, Si, Cl, BD, Dp, Pt, Micro, Macro, soil moisture at field capacity (theta(10)), and soil moisture at permanent wilting point (theta(1500)). The set A4 along with the RF and SVR models had the best performances in the prediction of Ks. As for soil moisture, the RF, SVR, and MARS models showed the best performances with low RMSE and ME values, and R-2 above 0.8 using predictor sets A3 and A4.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available