4.7 Article

CRISPRDetect: A flexible algorithm to define CRISPR arrays

期刊

BMC GENOMICS
卷 17, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s12864-016-2627-0

关键词

Phage resistance; Plasmids; Horizontal gene transfer; Cas; CRISPR; Small RNA targets; crRNA; Bioinformatics; Repeat elements

资金

  1. Marsden Fund
  2. Rutherford Discovery Fellowship from the Royal Society of NZ
  3. Human Frontier Science Program
  4. University of Otago Postgraduate Scholarship
  5. University of Otago Postgraduate Publishing Bursary
  6. University of Otago's Division of Health Sciences Career Development postdoctoral fellowship

向作者/读者索取更多资源

Background: CRISPR (clustered regularly interspaced short palindromic repeats) RNAs provide the specificity for noncoding RNA-guided adaptive immune defence systems in prokaryotes. CRISPR arrays consist of repeat sequences separated by specific spacer sequences. CRISPR arrays have previously been identified in a large proportion of prokaryotic genomes. However, currently available detection algorithms do not utilise recently discovered features regarding CRISPR loci. Results: We have developed a new approach to automatically detect, predict and interactively refine CRISPR arrays. It is available as a web program and command line from bioanalysis.otago.ac.nz/CRISPRDetect. CRISPRDetect discovers putative arrays, extends the array by detecting additional variant repeats, corrects the direction of arrays, refines the repeat/spacer boundaries, and annotates different types of sequence variations (e.g. insertion/deletion) in near identical repeats. Due to these features, CRISPRDetect has significant advantages when compared to existing identification tools. As well as further support for small medium and large repeats, CRISPRDetect identified a class of arrays with 'extra-large' repeats in bacteria (repeats 44-50 nt). The CRISPRDetect output is integrated with other analysis tools. Notably, the predicted spacers can be directly utilised by CRISPRTarget to predict targets. Conclusion: CRISPRDetect enables more accurate detection of arrays and spacers and its gff output is suitable for inclusion in genome annotation pipelines and visualisation. It has been used to analyse all complete bacterial and archaeal reference genomes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据