4.6 Article

Longer is Not Always Better: Optimizing Barcode Length for Large-Scale Species Discovery and Identification

期刊

SYSTEMATIC BIOLOGY
卷 69, 期 5, 页码 999-1015

出版社

OXFORD UNIV PRESS
DOI: 10.1093/sysbio/syaa014

关键词

DNA barcoding; metabarcoding; mini-barcodes; species discovery

资金

  1. Ministry of Education grant on biodiversity discovery [R-154-000-A22-112]

向作者/读者索取更多资源

New techniques for the species-level sorting of millions of specimens are needed in order to accelerate species discovery, determine how many species live on earth, and develop efficient biomonitoring techniques. These sorting methods should be reliable, scalable, and cost-effective, as well as being largely insensitive to low-quality genomic DNA, given that this is usually all that can be obtained from museum specimens. Mini-barcodes seem to satisfy these criteria, but it is unclear how well they perform for species-level sorting when compared with full-length barcodes. This is here tested based on 20 empirical data sets covering ca. 30,000 specimens (5500 species) and six Glade-specific data sets from GenBank covering ca. 98,000 specimens (>20,000 species). All specimens in these data sets had full-length barcodes and had been sorted to species-level based on morphology. Mini-barcodes of different lengths and positions were obtained in silico from full-length barcodes using a sliding window approach (three windows: 100 bp, 200 bp, and 300 bp) and by excising nine mini-barcodes with established primers (length: 94-407 bp). We then tested whether barcode length and/or position reduces species-level congruence between morphospecies and molecular operational taxonomic units (mOTUs) that were obtained using three different species delimitation techniques (Poisson Tree Process, Automatic Barcode Gap Discovery, and Objective Clustering). Surprisingly, we find no significant differences in performance for both species- or specimen-level identification between full-length and mini-barcodes as long as they are of moderate length (>200 bp). Only very short mini-barcodes (<200 bp) perform poorly, especially when they are located near the 5' end of the Folmer region. The mean congruence between morphospecies and mOTUs was ca. 75% for barcodes >200 bp and the congruent mOTUs contain ca. 75% of all specimens. Most conflict is caused by ca. 10% of the specimens that can be identified and should be targeted for reexamination in order to efficiently resolve conflict. Our study suggests that large-scale species discovery, identification, and metabarcoding can utilize mini-barcodes without any demonstrable loss of information compared to full-length barcodes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据