4.7 Article

Supervised machine learning is superior to indicator value inference in monitoring the environmental impacts of salmon aquaculture using eDNA metabarcodes

Journal

MOLECULAR ECOLOGY
Volume 30, Issue 13, Pages 2988-3006

Publisher

WILEY
DOI: 10.1111/mec.15434

Keywords

bacteria; ciliates; eDNA; environmental monitoring; random forest; salmon farming

Funding

  1. Norwegian Seafood Research Fund [901092]
  2. German Research Foundation (DFG) [STO414/15-1]
  3. Swiss National Science Foundation [31003A_179125]

Ask authors/readers for more resources

The study compared the performance of two different methods (supervised machine learning and indicator value calculation) in inferring ecological quality using DNA metabarcodes as biological indicators, and recommended using bacterial DNA metabarcodes in combination with supervised machine learning for classification.
Increasing anthropogenic impact and global change effects on natural ecosystems has prompted the development of less expensive and more efficient bioassessments methodologies. One promising approach is the integration of DNA metabarcoding in environmental monitoring. A critical step in this process is the inference of ecological quality (EQ) status from identified molecular bioindicator signatures that mirror environmental classification based on standard macroinvertebrate surveys. The most promising approaches to infer EQ from biotic indices (BI) are supervised machine learning (SML) and the calculation of indicator values (IndVal). In this study we compared the performance of both approaches using DNA metabarcodes of bacteria and ciliates as bioindicators obtained from 152 samples collected from seven Norwegian salmon farms. Results from standard macroinvertebrate-monitoring of the same samples were used as reference to compare the accuracy of both approaches. First, SML outperformed the IndVal approach to infer EQ from eDNA metabarcodes. The Random Forest (RF) algorithm appeared to be less sensitive to noisy data (a typical feature of massive environmental sequence data sets) and uneven data coverage across EQ classes (a typical feature of environmental compliance monitoring scheme) compared to a widely used method to infer IndVals for the calculation of a BI. Second, bacteria allowed for a more accurate EQ assessment than ciliate eDNA metabarcodes. For the implementation of DNA metabarcoding into routine monitoring programmes to assess EQ around salmon aquaculture cages, we therefore recommend bacterial DNA metabarcodes in combination with SML to classify EQ categories based on molecular signatures.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available