4.7 Article

Suspect screening of large numbers of emerging contaminants in environmental waters using artificial neural networks for chromatographic retention time prediction and high resolution mass spectrometry data analysis

Journal

SCIENCE OF THE TOTAL ENVIRONMENT
Volume 538, Issue -, Pages 934-941

Publisher

ELSEVIER SCIENCE BV
DOI: 10.1016/j.scitotenv.2015.08.078

Keywords

Retention time prediction; Artificial neural networks; Time-of-flight high resolution mass spectrometry; Screening of emerging contaminants

Funding

  1. EU-International Training Network SEWPROF (Marie Curie - PEOPLE Grant) [317205]
  2. Spanish Ministry of Economy and Competitiveness [CTQ2012-36189]
  3. Generalitat Valenciana, Spain [research group of excellence PROMETEO II/2014/023, ISIC 2012/016]
  4. Biotechnology and Biological Sciences Research Council (BBSRC)
  5. AstraZeneca (under the Global SHE research program) [BB/K501177/1]
  6. Biotechnology and Biological Sciences Research Council [1732933] Funding Source: researchfish

Ask authors/readers for more resources

The recent development of broad-scope high resolution mass spectrometry (HRMS) screening methods has resulted in a much improved capability for new compound identification in environmental samples. However, positive identifications at the ng/L concentration level rely on analytical reference standards for chromatographic retention time (t(R)) and mass spectral comparisons. Chromatographic t(R) prediction can play a role in increasing confidence in suspect screening efforts for new compounds in the environment, especially when standards are not available, but reliable methods are lacking. The current work focuses on the development of artificial neural networks (ANNs) for t(R) prediction in gradient reversed-phase liquid chromatography and applied along with HRMS data to suspect screening of wastewater and environmental surface water samples. Based on a compound t(R) dataset of >500 compounds, an optimized 4-layer back-propagation multi-layer perceptron model enabled predictions for 85% of all compounds to within 2 min of their measured t(R) for training (n = 344) and verification (n = 100) datasets. To evaluate the ANN ability for generalization to new data, the model was further tested using 100 randomly selected compounds and revealed 95% prediction accuracy within the 2-minute elution interval. Given the increasing concern on the presence of drug metabolites and other transformation products (TPs) in the aquatic environment, the model was applied along with HRMS data for preliminary identification of pharmaceutically-related compounds in real samples. Examples of compounds where reference standards were subsequently acquired and later confirmed are also presented. To our knowledge, this work presents for the first time, the successful application of an accurate retention time predictor and HEMS data-mining using the largest number of compounds to preliminarily identify new or emerging contaminants in wastewater and surface waters. (C) 2015 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available