4.6 Article

Prediction of Polycyclic Aromatic Hydrocarbons (PAHs) Removal from Wastewater Treatment Sludge Using Machine Learning Methods

Journal

WATER AIR AND SOIL POLLUTION
Volume 232, Issue 3, Pages -

Publisher

SPRINGER INT PUBL AG
DOI: 10.1007/s11270-021-05049-8

Keywords

PAH; Wastewater treatment sludge; UV-C light; Data mining; machine learning; Over-sampling methods; Prediction of PAH removal efficiency

Funding

  1. Commission of Scientific Research Projects of Bursa Uludag University [UAP (M) 2009/20]

Ask authors/readers for more resources

This study focused on predicting the removal efficiency of PAHs from wastewater treatment sludges using machine learning methods. Various classification and machine learning methods were proposed, with RF and k-NN showing the best performance with high prediction accuracies. RF outperformed other methods in predicting removal efficiencies on multi-class imbalanced datasets, providing cost-effective and efficient prediction results.
Removal of polycyclic aromatic hydrocarbons (PAHs) from wastewater treatment sludge with appropriate technologies is of great importance for nature and public health. UV technology is one of the most frequently used methods for the removal of PAHs. While various photodegradation applications with UV-C (ultraviolet-C) light and photocatalysts can be performed to remove these compounds, a large number of tests should be implemented to determine optimum removal conditions, which increase time and cost. It is possible to make predictions for the removal efficiency of PAHs by using data mining classification and reveal the hidden knowledge from data. This study aims to determine appropriate machine learning (ML) methods for the prediction of the PAH removal efficiency from wastewater treatment sludges regarding the initial PAH levels. The samples have multi-class imbalanced outputs; thus, random over-sampling and Synthetic Minority Over-sampling TEchniques (SMOTE) are used to improve the prediction results. Well-known data mining classification/machine learning methods, artificial neural network (multi-layer perceptron-MLP), k-means (k-NN), support vector machine (SVM), decision tree (C4.5), random forest (RF), and Bagging, are proposed for the prediction of removal efficiencies. Different evaluation metrics, Accuracy, multi-class AUC (MAUC-multi-class area under ROC curve), F-measure, Precision, Recall, and Specificity are used for the performance comparisons. RF and k-NN perform better with 92.35% and 92.36% average prediction accuracies, respectively. Besides, RF outperforms other methods with 0.97 MAUC value. RF and k-NN can be used for the removal efficiency prediction on the multi-class imbalanced datasets successfully, and removal efficiencies can be highly predicted considering input components with less cost and effort.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available