4.1 Article

Statistical methods for analysis of single-cell RNA-sequencing data

Journal

METHODSX
Volume 8, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.mex.2021.101580

Keywords

Zero inflated negative binomial model; Molecular capture model; Observed UMI count; True UMI count; Mean; Zero Inflation; Overdispersion

Funding

  1. Indian Council of Agricultural Research (ICAR), New Delhi, India [18(02)/2016-EQR/Edn]
  2. National Institutes of Health (NIH), USA [5P20GM113226, 1P42ES023716, 5P30GM127607-02, 1P20GM125504-01, 2U54HL120163, 1P20GM135004, 1R35ES0238373-01, 1R01ES029846, 1P30ES030283]
  3. Kentucky Council on Postsecondary Education grant, USA [PON2 415 190 0002934]

Ask authors/readers for more resources

This article introduces a novel statistical approach for analyzing scRNA-seq data, including cell type detection, estimation of cell capture rates, and gene specific model parameters estimation. The method takes into consideration the biological processes that lead to severe dropout events in observed UMI counts of genes, and is able to perform differential expression and other downstream analyses.
Single-cell RNA-sequencing (scRNA-seq) is a recent high-throughput genomic technology used to study the expression dynamics of genes at single-cell level. Analyzing the scRNA-seq data in presence of biological confounding factors including dropout events is a challenging task. Thus, this article presents a novel statistical approach for various analyses of the scRNA-seq Unique Molecular Identifier (UMI) counts data. The various analyses include modeling and fitting of observed UMI data, cell type detection, estimation of cell capture rates, estimation of gene specific model parameters, estimation of the sample mean and sample variance of the genes, etc. Besides, the developed approach is able to perform differential expression, and other downstream analyses that consider the molecular capture process in scRNA-seq data modeling. Here, the external spike-ins data can also be used in the approach for better results. The unique feature of the method is that it considers the biological process that leads to severe dropout events in modeling the observed UMI counts of genes. The differential expression analysis of observed scRNA-seq UMI counts data is performed after adjustment for cell capture rates. The statistical approach performs downstream differential zero inflation analysis, classification of influential genes, and selection of top marker genes. Cell auxiliaries including cell clusters and other cell variables (e.g., cell cycle, cell phase) are used to remove unwanted variation to perform statistical tests reliably. Published by Elsevier B.V.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available