期刊
BIOINFORMATICS
卷 35, 期 18, 页码 3372-3377出版社
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btz089
关键词
-
类别
资金
- Biotechnology and Biological Sciences Research Council [BB/H002286/1, BB/J00247X/1, BB/M010066/1, BB/M004155/1]
- BBSRC [BB/J00247X/1, BB/M010066/1, BB/M004155/1, BB/H002286/1] Funding Source: UKRI
Motivation: RNA-seq experiments are usually carried out in three or fewer replicates. In order to work well with so few samples, differential gene expression (DGE) tools typically assume the form of the underlying gene expression distribution. In this paper, the statistical properties of gene expression from RNA-seq are investigated in the complex eukaryote, Arabidopsis thaliana, extending and generalizing the results of previous work in the simple eukaryote Saccharomyces cerevisiae. Results: We show that, consistent with the results in S.cerevisiae, more gene expression measurements in A.thaliana are consistent with being drawn from an underlying negative binomial distribution than either a log-normal distribution or a normal distribution, and that the size and complexity of the A.thaliana transcriptome does not influence the false positive rate performance of nine widely used DGE tools tested here. We therefore recommend the use of DGE tools that are based on the negative binomial distribution.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据