4.6 Review

Comparative evaluation of full-length isoform quantification from RNA-Seq

期刊

BMC BIOINFORMATICS
卷 22, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12859-021-04198-1

关键词

Benchmarking; Isoform quantification; Simulated data; Pseudo-alignment; RNA-seq; Short reads

资金

  1. National Center for Advancing Translational Sciences Grant [5UL1TR000003]
  2. National Institute of Mental Health (NIMH) T32 Training grant [5T32MH106442-04]

向作者/读者索取更多资源

The accuracy of full-length isoform quantification is influenced by factors such as length, sequence compression complexity, and incomplete annotation. Salmon, kallisto, RSEM, and Cufflinks perform well on idealized data, but not significantly better than simpler approaches on realistic data. Therefore, full-length isoform quantification and isoform-level differential expression analysis should be selectively employed.
Background Full-length isoform quantification from RNA-Seq is a key goal in transcriptomics analyses and has been an area of active development since the beginning. The fundamental difficulty stems from the fact that RNA transcripts are long, while RNA-Seq reads are short. Results Here we use simulated benchmarking data that reflects many properties of real data, including polymorphisms, intron signal and non-uniform coverage, allowing for systematic comparative analyses of isoform quantification accuracy and its impact on differential expression analysis. Genome, transcriptome and pseudo alignment-based methods are included; and a simple approach is included as a baseline control. Conclusions Salmon, kallisto, RSEM, and Cufflinks exhibit the highest accuracy on idealized data, while on more realistic data they do not perform dramatically better than the simple approach. We determine the structural parameters with the greatest impact on quantification accuracy to be length and sequence compression complexity and not so much the number of isoforms. The effect of incomplete annotation on performance is also investigated. Overall, the tested methods show sufficient divergence from the truth to suggest that full-length isoform quantification and isoform level DE should still be employed selectively.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据