4.7 Article

Normalization by distributional resampling of high throughput single-cell RNA-sequencing data

期刊

BIOINFORMATICS
卷 37, 期 22, 页码 4123-4128

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab450

关键词

-

资金

  1. National Library of Medicine Bio-Data Science Training program [T32LM012413]
  2. National Institutes of Health grant [NIHGM102756]

向作者/读者索取更多资源

Dino is a normalization method based on a flexible negative-binomial mixture model of gene expression, which improves downstream analysis performance in various settings.
Motivation: Normalization to remove technical or experimental artifacts is critical in the analysis of single-cell RNA-sequencing experiments, even those for which unique molecular identifiers are available. The majority of methods for normalizing single-cell RNA-sequencing data adjust average expression for library size (LS), allowing the variance and other properties of the gene-specific expression distribution to be non-constant in LS. This often results in reduced power and increased false discoveries in downstream analyses, a problem which is exacerbated by the high proportion of zeros present in most datasets. Results: To address this, we present Dino, a normalization method based on a flexible negative-binomial mixture model of gene expression. As demonstrated in both simulated and case study datasets, by normalizing the entire gene expression distribution, Dino is robust to shallow sequencing, sample heterogeneity and varying zero proportions, leading to improved performance in downstream analyses in a number of settings.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据