4.0 Article

Single-gene negative binomial regression models for RNA-Seq data with higher-order asymptotic inference

Journal

STATISTICS AND ITS INTERFACE
Volume 8, Issue 4, Pages 405-418

Publisher

INT PRESS BOSTON, INC
DOI: 10.4310/SII.2015.v8.n4.a1

Keywords

RNA-Seq; Higher-order asymptotics; Negative binomial; Regression; Overdispersion; Extra-Poisson variation; Power-robustness

Funding

  1. NIH/NIGMS Award [R01GM104977]

Ask authors/readers for more resources

We consider negative binomial (NB) regression models for RNA-Seq read counts and investigate an approach where such NB regression models are fitted to individual genes separately and, in particular, the NB dispersion parameter is estimated from each gene separately without assuming commonalities between genes. This single-gene approach contrasts with the more widely-used dispersion-modeling approach where the NB dispersion is modeled as a simple function of the mean or other measures of read abundance, and then estimated from a large number of genes combined. We show that through the use of higher-order asymptotic techniques, inferences with correct type I errors can be made about the regression coefficients in a single-gene NB regression model even when the dispersion is unknown and the sample size is small. The motivations for studying single-gene models include: 1) they provide a basis of reference for understanding and quantifying the power-robustness trade-offs of the dispersion-modeling approach; 2) they can also be potentially useful in practice if moderate sample sizes become available and diagnostic tools indicate potential problems with simple models of dispersion.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.0
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available