4.7 Article

Prediction of undrained shear strength using extreme gradient boosting and random forest based on Bayesian optimization

Journal

GEOSCIENCE FRONTIERS
Volume 12, Issue 1, Pages 469-477

Publisher

CHINA UNIV GEOSCIENCES, BEIJING
DOI: 10.1016/j.gsf.2020.03.007

Keywords

Undrained shear strength; Extreme gradient boosting; Random forest; Bayesian optimization; k-fold CV

Funding

  1. High-end Foreign Expert Introduction program [G20190022002]
  2. Chongqing Construction Science and Technology Plan Project [2019-0045]
  3. Chongqing Engineering Research Center of Disaster Prevention & Control for Banks and Structures in Three Gorges Reservoir Area [SXAPGC18ZD01, SXAPGC18YB03]

Ask authors/readers for more resources

This study applies XGBoost and RF methods to predict the USS of soft clays, showing that these approaches outperform traditional methods. Bayesian optimization is used to determine model hyperparameters, leading to more accurate and robust models.
Accurate assessment of undrained shear strength (USS) for soft sensitive clays is a great concern in geotechnical engineering practice. This study applies novel data-driven extreme gradient boosting (XGBoost) and random forest (RF) ensemble learning methods for capturing the relationships between the USS and various basic soil parameters. Based on the soil data sets from TC304 database, a general approach is developed to predict the USS of soft clays using the two machine learning methods above, where five feature variables including the preconsolidation stress (PS), vertical effective stress (VES), liquid limit (LL), plastic limit (PL) and natural water content (W) are adopted. To reduce the dependence on the rule of thumb and inefficient brute-force search, the Bayesian optimization method is applied to determine the appropriate model hyper-parameters of both XGBoost and RF. The developed models are comprehensively compared with three comparison machine learning methods and two transformation models with respect to predictive accuracy and robustness under 5-fold cross-validation (CV). It is shown that XGBoost-based and RF-based methods outperform these approaches. Besides, the XGBoost-based model provides feature importance ranks, which makes it a promising tool in the prediction of geotechnical parameters and enhances the interpretability of model.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available