4.7 Article

Exploring quantitative structure-property relationship models for environmental fate assessment of petroleum hydrocarbons

Journal

ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH
Volume 30, Issue 10, Pages 26218-26233

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s11356-022-23904-x

Keywords

Petroleum hydrocarbons; Quantitative structure-property relationship; OECD; Biodegradation half-life; Double cross-validation; Best subset selection; Partial least squares

Ask authors/readers for more resources

This study develops predictive QSPR models for forecasting the biodegradation half-life of untested petroleum hydrocarbons within the established models' applicability domain. The models use easily computable 2D descriptors to investigate important structural characteristics for biodegradation in different aquatic environments. The models are robust, predictive, and mechanistically interpretable, and the prediction quality is improved using the intelligent consensus predictor tool.
The rate and extent of biodegradation of petroleum hydrocarbons in the different aquatic environments is an important element to address. The major avenue for removing petroleum hydrocarbons from the environment is thought to be biodegradation. The present study involves the development of predictive quantitative structure-property relationship (QSPR) models for the primary biodegradation half-life of petroleum hydrocarbons that may be used to forecast the biodegradation half-life of untested petroleum hydrocarbons within the established models' applicability domain. These models use easily computable two-dimensional (2D) descriptors to investigate important structural characteristics needed for the biodegradation of petroleum hydrocarbons in freshwater (dataset 1), temperate seawater (dataset 2), and arctic seawater (dataset 3). All the developed models follow OECD guidelines. We have used double cross-validation, best subset selection, and partial least squares tools for model development. In addition, the small dataset modeler tool has been successfully used for the dataset with very few compounds (dataset 3 with 17 compounds), where dataset division was not possible. The resultant models are robust, predictive, and mechanistically interpretable based on both internal and external validation metrics (R-2 range of 0.605-0.959. Q((Loo))(2) range of 0.509-0.904, and Q(2) (F1) range of 0.526-0.959). The intelligent consensus predictor tool has been used for the improvement of the prediction quality for test set compounds which provided superior outcomes to those from individual partial least squares models based on several metrics (Q(F1)(2) = 0.808 and Q(F2)(2) = 0.805 for dataset 1 in freshwater). Molecular size and hydrophilic factor for freshwater, frequency of two carbon atoms at topological distance 4 for temperate seawater, and electronegative atom count relative to size for arctic seawater were found to be the most significant descriptors responsible for the regulation of biodegradation half-life of petroleum hydrocarbons.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available