4.7 Article

A comparative study of RNA-seq analysis strategies

期刊

BRIEFINGS IN BIOINFORMATICS
卷 16, 期 6, 页码 932-940

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbv007

关键词

RNA-seq; transcriptome assembly; gene expression; RNA splicing

资金

  1. Wellcome Trust [WT097679]
  2. Cambridge Biomedical Research Centre
  3. Cancer Research UK [C14303/A10825]
  4. Medical Research Council [G1002319]
  5. Biotechnology and Biological Sciences Research Council [BB/G000352/1] Funding Source: researchfish
  6. Medical Research Council [MC_UP_0801/1, G1002319] Funding Source: researchfish
  7. BBSRC [BB/G000352/1] Funding Source: UKRI
  8. MRC [MC_UP_0801/1, G1002319] Funding Source: UKRI

向作者/读者索取更多资源

Three principal approaches have been proposed for inferring the set of transcripts expressed in RNA samples using RNA-seq. The simplest approach uses curated annotations, which assumes the transcripts in a sample are a subset of the transcripts listed in a curated database. A more ambitious method involves aligning reads to a reference genome and using the alignments to infer the transcript structures, possibly with the aid of a curated transcript database. The most challenging approach is to assemble reads into putative transcripts de novo without the aid of reference data. We have systematically assessed the properties of these three approaches through a simulation study. We have found that the sensitivity of computational transcript set estimation is severely limited. Computational approaches (both genome-guided and de novo assembly) produce a large number of artefacts, which are assigned large expression estimates and absorb a substantial proportion of the signal when performing expression analysis. The approach using curated annotations shows good expression correlation even when the annotations are incomplete. Furthermore, any incorrect transcripts present in a curated set do not absorb much signal, so it is preferable to have a curation set with high sensitivity than high precision. Software to simulate transcript sets, expression values and sequence reads under a wider range of parameter values and to compare sensitivity, precision and signal-to-noise ratios of different methods is freely available online (https://github.com/boboppie/RSSS) and can be expanded by interested parties to include methods other than the exemplars presented in this article.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据