4.6 Article

Prediction of Polycyclic Aromatic Hydrocarbons (PAHs) Removal from Wastewater Treatment Sludge Using Machine Learning Methods

期刊

WATER AIR AND SOIL POLLUTION
卷 232, 期 3, 页码 -

出版社

SPRINGER INT PUBL AG
DOI: 10.1007/s11270-021-05049-8

关键词

PAH; Wastewater treatment sludge; UV-C light; Data mining; machine learning; Over-sampling methods; Prediction of PAH removal efficiency

资金

  1. Commission of Scientific Research Projects of Bursa Uludag University [UAP (M) 2009/20]

向作者/读者索取更多资源

This study focused on predicting the removal efficiency of PAHs from wastewater treatment sludges using machine learning methods. Various classification and machine learning methods were proposed, with RF and k-NN showing the best performance with high prediction accuracies. RF outperformed other methods in predicting removal efficiencies on multi-class imbalanced datasets, providing cost-effective and efficient prediction results.
Removal of polycyclic aromatic hydrocarbons (PAHs) from wastewater treatment sludge with appropriate technologies is of great importance for nature and public health. UV technology is one of the most frequently used methods for the removal of PAHs. While various photodegradation applications with UV-C (ultraviolet-C) light and photocatalysts can be performed to remove these compounds, a large number of tests should be implemented to determine optimum removal conditions, which increase time and cost. It is possible to make predictions for the removal efficiency of PAHs by using data mining classification and reveal the hidden knowledge from data. This study aims to determine appropriate machine learning (ML) methods for the prediction of the PAH removal efficiency from wastewater treatment sludges regarding the initial PAH levels. The samples have multi-class imbalanced outputs; thus, random over-sampling and Synthetic Minority Over-sampling TEchniques (SMOTE) are used to improve the prediction results. Well-known data mining classification/machine learning methods, artificial neural network (multi-layer perceptron-MLP), k-means (k-NN), support vector machine (SVM), decision tree (C4.5), random forest (RF), and Bagging, are proposed for the prediction of removal efficiencies. Different evaluation metrics, Accuracy, multi-class AUC (MAUC-multi-class area under ROC curve), F-measure, Precision, Recall, and Specificity are used for the performance comparisons. RF and k-NN perform better with 92.35% and 92.36% average prediction accuracies, respectively. Besides, RF outperforms other methods with 0.97 MAUC value. RF and k-NN can be used for the removal efficiency prediction on the multi-class imbalanced datasets successfully, and removal efficiencies can be highly predicted considering input components with less cost and effort.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据