4.7 Article

Source allocation of per- and polyfluoroalkyl substances (PFAS) with supervised machine learning: Classification performance and the role of feature selection in an expanded dataset

期刊

CHEMOSPHERE
卷 275, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.chemosphere.2021.130124

关键词

PFAS Source allocation; Machine learning; Neural networks; Pattern recognition; Classification

资金

  1. U.S. Department of Defense, through the Strategic Environmental Research and Development Program (SERDP) [ER20-1205]

向作者/读者索取更多资源

This study explores the use of supervised machine learning to identify the source of PFAS in water samples, focusing on distinguishing between PFAS from AFFF fire suppression foam and other sources. The results show that although PFAS composition can vary significantly at a site, machine learning can be used to recognize compositional patterns in the environment for source allocation.
This work explores the use of supervised machine learning as a tool for identifying the source of per- and polyfluorinated alkyl substances (PFAS) in water samples on the basis of the detected component concentrations. Specifically, the work focuses on distinguishing between PFAS used in aqueous film forming foam (AFFF) fire suppression applications, and PFAS from other sources. The fact that many sites contaminated with legacy PFOS-based AFFF formulations are dominated by perfluorinated sulfonates can make it tempting to naively classify samples dominated by perfluorinated sulfonates as being of AFFF origin. However, a large fraction of samples do not follow this pattern, including some of the most important cases, such as legacy PFOS-based AFFF far from its source. Although PFAS composition can vary substantially at a site as a result of mobility differences between components and other factors, the hypothesis driving the work is that compositional patterns created in the environment can be recognized across different sites by machine learning, and used for source allocation. This work builds on earlier preliminary work by the authors based on a small dataset. This work is based on a much larger 8040-sample dataset, and explores different preprocessing approaches, as well as how feature selection impacts classification performance. The results of this work strongly support the idea that supervised machine learning based on composition can identify patterns that can be used to distinguish PFAS sources. The results provide new insights into selection of classifiers and features for source identification based on PFAS sample composition. (C) 2021 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据