4.1 Article Data Paper

A curated dataset of complete Enterobacteriaceae plasmids compiled from the NCBI nucleotide database

期刊

DATA IN BRIEF
卷 12, 期 -, 页码 423-426

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.dib.2017.04.024

关键词

Plasmids; Sequence data curation; Complete genomes; Enterobacteriaceae family

资金

  1. National Institute for Health Research Health Protection Research Unit (NIHR HPRU) in Healthcare Associated Infections and Antimicrobial Resistance at Oxford University
  2. Public Health England (PHE) [HPRU-2012-10041]
  3. NIHR/University of Oxford Academic Clinical Lectureship

向作者/读者索取更多资源

Thousands of plasmid sequences are now publicly available in the NCBI nucleotide database, but they are not reliably annotated to distinguish complete plasmids from plasmid fragments, such as gene or contig sequences; therefore, retrieving complete plasmids for downstream analyses is challenging. Here we present a curated dataset of complete bacterial plasmids from the clinically relevant Enterobacteriaceae family. The dataset was compiled from the NCBI nucleotide database using curation steps designed to exclude incomplete plasmid sequences, and chromosomal sequences misannotated as plasmids. Over 2000 complete plasmid sequences are included in the curated plasmid dataset. Protein sequences produced from translating each complete plasmid nucleotide sequence in all 6 frames are also provided. Further analysis and discussion of the dataset is presented in an accompanying research article: Ordering the mob: insights into replicon and MOB typing... (Orlek et al., 2017) [1]. The curated plasmid sequences are publicly available in the Figshare repository. (C) 2017 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.1
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据