4.7 Article

Supervised machine learning for source allocation of per- and polyfluoroalkyl substances (PFAS) in environmental samples

Journal

CHEMOSPHERE
Volume 252, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.chemosphere.2020.126593

Keywords

PFAS; Source allocation; Machine learning; Neural networks; Pattern recognition

Funding

  1. U.S. Department of Defense, through the Strategic Environmental Research and Development Program (SERDP) [ER20-1205]
  2. Humphrey's Engineer Centre Support Activity [W912HQ-20-P-0005]

Ask authors/readers for more resources

Environmental contamination by per- and polyfluoroalkyl substances (PFAS) is widespread, because of both their decades of use, and their persistence in the environment. These factors can make identification of the source of contamination in samples a challenge, because in many cases contamination may originate from decades ago, or from a number of candidate sources. Forensic source allocation is important for delineating plumes, and may also be able to provide insights into environmental behaviors of specific PFAS components. This paper describes work conducted to explore the use of supervised machine learning classifiers for allocating the source of PFAS contamination based on patterns identified in component concentrations. A dataset containing PFAS component concentrations in 1197 environmental water samples was assembled based on data from sites from around the world. The dataset was split evenly into training and test datasets, and the 598-sample training dataset was used to train four machine learning classifiers, including three conventional machine learning classifiers (Extra Trees, Support-Vector Machines, K-Neighbors), and one multilayer perceptron feedforward deep neural network. Of the methods tested, the deep neural network and Extra Trees exhibited particularly high performance at classification of samples from a range of sources. The fact that the methods function on completely different principles and yet provide similar predictions supports the hypothesis that patterns exist in PFAS water sample data that can allow forensic source allocation. The results of the work support the idea that supervised machine learning may have substantial promise as a tool for forensic source allocation. (C) 2020 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available