4.6 Article

Introducing an rbcL and a trnL reference library to aid in the metabarcoding analysis of foraged plants from two semi-arid eastern South African savanna bioregions

期刊

PLOS ONE
卷 18, 期 5, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pone.0286144

关键词

-

向作者/读者索取更多资源

The purpose of this study was to build DNA barcode sequence reference databases for plant species that are potential herbivore foraging targets in semi-arid savannas of eastern South Africa. The researchers collected and mined species sequences from GenBank and BOLD databases to ensure accurate taxonomic coverage and resolution. The final rbcL and trnL datasets contained sequences representing multiple genera and species, which should be used together to identify plants in the semi-arid eastern savannas of South Africa.
Success of a metabarcoding study is determined by the extent of taxonomic coverage and the quality of records available in the DNA barcode reference database used. This study aimed to create an rbcL and a trnL (UAA) DNA barcode sequence reference database of plant species that are potential herbivore foraging targets and commonly found in semi-arid savannas of eastern South Africa. An area-specific species list of 765 species was compiled according to plant collection records available and areas comparable to an eastern semi-arid South African savanna. Thereafter, rbcL and trnL sequences of species from this list were mined from GenBank and BOLD sequence databases according to specific quality criteria to ensure accurate taxonomic coverage and resolution. These were supplemented with sequences of 24 species sequenced for this study. A phylogenetic approach, employing Neighbor-Joining, was used to verify the topology of the reference libraries to known angiosperm phylogeny. The taxonomic reliability of these reference libraries was evaluated by testing for the presence of a barcode gap, identifying a data-appropriate identification threshold, and determining the identification accuracy of reference sequences via primary distance-based criteria. The final rbcL reference dataset consisted of 1238 sequences representing 318 genera and 562 species. The final trnL dataset consisted of 921 sequences representing 270 genera and 461 species. Barcode gaps were found for 76% of the taxa in the rbcL barcode reference dataset and 68% of the taxa in the trnL barcode reference dataset. The identification success rate, calculated with the k-nn criterion was 85.86% for the rbcL dataset and 73.72% for the trnL dataset. The datasets for rbcL and trnL combined during this study are not presented as complete DNA reference libraries, but rather as two datasets that should be used in unison to identify plants present in the semi-arid eastern savannas of South Africa.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据