4.7 Article

Combination of Machine Learning and Kriging for Spatial Estimation of Geological Attributes

Journal

NATURAL RESOURCES RESEARCH
Volume 31, Issue 1, Pages 191-213

Publisher

SPRINGER
DOI: 10.1007/s11053-021-10003-w

Keywords

Spatial estimation; Kriging; Machine learning; Super learner; Combination; Sequential quadratic programming

Funding

  1. Centre for Computational Geostatistics (CCG)

Ask authors/readers for more resources

This study proposes a methodology that combines kriging and machine learning models to improve the accuracy of spatial estimation of geological features. By combining the estimation results from both methods, more accurate estimates can be obtained compared to using either machine learning models or kriging alone. The effectiveness of the proposed methodology is demonstrated through experiments.
A growing number of studies in the spatial estimation of geological features use machine learning (ML) models, as these models promise to provide efficient solutions for estimation especially in non-Gaussian, non-stationary and complex cases. However, these models have two major limitations: (1) the data are considered to be independent and identically distributed (or spatially uncorrelated), and (2) the data are not reproduced at their locations. Kriging, on the other hand, has a long history of generating unbiased estimates with minimum error variance at unsampled locations. Kriging assumes stationarity and linearity. This study proposes a methodology that combines kriging and ML models to mitigate the disadvantages of each method and obtain more accurate estimates. In the proposed methodology, a stacked ensemble model, which is also referred to as the super learner (SL) model, is applied for ML modeling. We have shown how the estimates generated by the SL model and estimates obtained from kriging can be combined through a weighting function based on a kriging variance. The weights are optimized using the sequential quadratic programming. The proposed methodology is demonstrated in two synthetic case studies containing data with non-stationarity and non-Gaussian features; a real case study using a dataset from an oilsands deposit is also presented. The performance of the combined model is compared with the SL model and kriging using the coefficient of determination (R-squared), root mean squared error, and mean absolute error criteria. The combined model appears to yield more accurate estimates than the ones generated by SL model and kriging in all cases.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available