4.8 Article

How data analysis affects power, reproducibility and biological insight of RNA-seq studies in complex datasets

期刊

NUCLEIC ACIDS RESEARCH
卷 43, 期 16, 页码 7664-7674

出版社

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkv736

关键词

-

资金

  1. NRSA [T32NS007413, T32HL007953]
  2. Brush Family Professorship
  3. DARPA [58077 LSDRP]
  4. NIH [R01MH087463]
  5. [DA036984]
  6. [R01MH101491]

向作者/读者索取更多资源

The sequencing of the full transcriptome (RNA-seq) has become the preferred choice for the measurement of genome-wide gene expression. Despite its widespread use, challenges remain in RNA-seq data analysis. One often-overlooked aspect is normalization. Despite the fact that a variety of factors or 'batch effects' can contribute unwanted variation to the data, commonly used RNA-seq normalization methods only correct for sequencing depth. The study of gene expression is particularly problematic when it is influenced simultaneously by a variety of biological factors in addition to the one of interest. Using examples from experimental neuroscience, we show that batch effects can dominate the signal of interest; and that the choice of normalization method affects the power and reproducibility of the results. While commonly used global normalization methods are not able to adequately normalize the data, more recently developed RNA-seq normalization can. We focus on one particular method, RUVSeq and show that it is able to increase power and biological insight of the results. Finally, we provide a tutorial outlining the implementation of RUVSeq normalization that is applicable to a broad range of studies as well as meta-analysis of publicly available data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据