4.7 Article

NEBULA is a fast negative binomial mixed model for differential or co-expression analysis of large- scale multi-subject single-cell data

Journal

COMMUNICATIONS BIOLOGY
Volume 4, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s42003-021-02146-6

Keywords

-

Funding

  1. National Institute on Aging [R01 AG061853, R01 AG065477, R01 AG070488]
  2. National Institute on Health [U19 AI089992, R25 NS079193, P01 AI073748, U24 AI11867, R01 AI22220, UM 1HG009390, P01 AI039671, P50 CA121974, R01 CA227473, R01 AG058002, U01 MH119509, R01 MH109978, R01 AG067151, R01 AG062335, U01 NS110453, UG3 NS115064, RF1 AG054012, RF1 AG062377]
  3. NIA [P30AG10161, R01AG15819, R01AG17917, R01AG30146, R01AG36042, RC2AG036547, R01AG36836, R01AG48015, RF1AG57473, U01AG32984, U01AG46152, U01AG46161, U01AG61356]
  4. Illinois Department of Public Health (ROSMAP)
  5. Translational Genomics Research Institute

Ask authors/readers for more resources

The NEBULA model proposed in this study offers a more efficient way to analyze multi-subject single-cell data, demonstrating faster processing and improved control over false positives. Using this model on Alzheimer's disease cohort data revealed specific correlations between APOE and other genetic risk factors in a cell-type-specific pattern.
The increasing availability of single-cell data revolutionizes the understanding of biological mechanisms at cellular resolution. For differential expression analysis in multi-subject single-cell data, negative binomial mixed models account for both subject-level and cell-level overdispersions, but are computationally demanding. Here, we propose an efficient NEgative Binomial mixed model Using a Large-sample Approximation (NEBULA). The speed gain is achieved by analytically solving high-dimensional integrals instead of using the Laplace approximation. We demonstrate that NEBULA is orders of magnitude faster than existing tools and controls false-positive errors in marker gene identification and co-expression analysis. Using NEBULA in Alzheimer's disease cohort data sets, we found that the cell-level expression of APOE correlated with that of other genetic risk factors (including CLU, CST3, TREM2, C1q, and ITM2B) in a cell-type-specific pattern and an isoform-dependent manner in microglia. NEBULA opens up a new avenue for the broad application of mixed models to large-scale multi-subject single-cell data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available