4.6 Article

SAUTE: sequence assembly using target enrichment

期刊

BMC BIOINFORMATICS
卷 22, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12859-021-04174-9

关键词

Illumina reads; De-novo assembly; de Bruijn graphs; Antimicrobial resistance; RNA-seq

资金

  1. National Institutes of Health (NIH)
  2. Intramural Research Program of the National Institutes of Health, National Library of Medicine

向作者/读者索取更多资源

SAUTE and SAUTE_PROT are assemblers proposed to assist with the assembly of repeat regions and to report multiple well supported variants when target sequences are provided. These assemblers utilize de Bruijn graphs on reads, with targets ranging from RNA-seq to genomic reads.
BackgroundIllumina is the dominant sequencing technology at this time. Short length, short insert size, some systematic biases, and low-level carryover contamination in Illumina reads continue to make assembly of repeated regions a challenging problem. Some applications also require finding multiple well supported variants for assembled regions.ResultsTo facilitate assembly of repeat regions and to report multiple well supported variants when a user can provide target sequences to assist the assembly, we propose SAUTE and SAUTE_PROT assemblers. Both assemblers use de Bruijn graph on reads. Targets can be transcripts or proteins for RNA-seq reads and transcripts, proteins, or genomic regions for genomic reads. Target sequences are nucleotide and protein sequences for SAUTE and SAUTE_PROT, respectively.ConclusionsFor RNA-seq, comparisons with Trinity, rnaSPAdes, SPAligner, and SPAdes assembly of reads aligned to target proteins by DIAMOND show that SAUTE_PROT finds more coding sequences that translate to benchmark proteins. Using AMRFinderPlus calls, we find SAUTE has higher sensitivity and precision than SPAdes, plasmidSPAdes, SPAligner, and SPAdes assembly of reads aligned to target regions by HISAT2. It also has better sensitivity than SKESA but worse precision.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据