4.6 Article

Multiclassification Prediction of Clay Sensitivity Using Extreme Gradient Boosting Based on Imbalanced Dataset

期刊

APPLIED SCIENCES-BASEL
卷 12, 期 3, 页码 -

出版社

MDPI
DOI: 10.3390/app12031143

关键词

clay sensitivity; imbalanced categories; SMOTE; XGBoost

资金

  1. National Natural Science Foundation of China [41790432, 51908093]
  2. National Key Research and Development Program of China [2021YFB2600026]

向作者/读者索取更多资源

This study investigates the performance of extreme gradient boosting (XGBoost) in predicting multiclass clay sensitivity and the ability of synthetic minority over-sampling technique (SMOTE) in addressing imbalanced categories. The results show that XGBoost performs the best in the prediction of clay sensitivity, while SMOTE is useful in addressing imbalanced issues.
Predicting clay sensitivity is important to geotechnical engineering design related to clay. Classification charts and field tests have been used to predict clay sensitivity. However, the imbalanced distribution of clay sensitivity is often neglected, and the predictive performance could be more accurate. The purpose of this study was to investigate the performance that extreme gradient boosting (XGboost) method had in predicting multiclass of clay sensitivity, and the ability that synthetic minority over-sampling technique (SMOTE) had in addressing imbalanced categories of clay sensitivity. Six clay parameters were used as the input parameters of XGBoost, and SMOTE was used to deal with imbalanced classes. Then, the dataset was divided using the cross-validation (CV) method. Finally, XGBoost, artificial neural network (ANN), and Naive Bayes (NB) were used to classify clay sensitivity. The F1 score, receiver operating characteristic (ROC), and area under the ROC curve (AUC) were considered as the performance indicators. The results revealed that XGBoost showed the best performance in the multiclassification prediction of clay sensitivity. The F1 score and mean AUC of XGBoost were 0.72 and 0.89, respectively. SMOTE was useful in addressing imbalanced issues, and XGBoost was an effective and reliable method of classifying clay sensitivity.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据