4.5 Article

CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data

Journal

GENOME BIOLOGY
Volume 18, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/s13059-017-1188-0

Keywords

Single-cell; scRNA-seq; Dropout; Imputation; Dimensionality reduction; Clustering; Cell type

Funding

  1. New South Wales Ministry of Health
  2. Human Frontier Science Program [RGY0084/2014]
  3. National Health and Medical Research Council of Australia [1105271]
  4. National Heart Foundation of Australia
  5. Amazon Web Services (AWS) Cloud Credits for Research
  6. National Health and Medical Research Council of Australia [1105271] Funding Source: NHMRC

Ask authors/readers for more resources

Most existing dimensionality reduction and clustering packages for single-cell RNA-seq (scRNA-seq) data deal with dropouts by heavy modeling and computational machinery. Here, we introduce CIDR (Clustering through Imputation and Dimensionality Reduction), an ultrafast algorithm that uses a novel yet very simple implicit imputation approach to alleviate the impact of dropouts in scRNA-seq data in a principled manner. Using a range of simulated and real data, we show that CIDR improves the standard principal component analysis and outperforms the state-of-the-art methods, namely t-SNE, ZIFA, andRaceID, in terms of clustering accuracy. CIDR typically completes within seconds when processing a data set of hundreds of cells and minutes for a data set of thousands of cells. CIDR can be downloaded at https://github.com/VCCRI/CIDR.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available