4.5 Article

Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data

Journal

GENOME BIOLOGY
Volume 21, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s13059-020-1949-z

Keywords

scRNA-seq; Functional analysis; Transcription factor analysis; Pathway analysis; Benchmark

Funding

  1. German Federal Ministry of Education and Research (BMBF) [FKZ: 031 L0049]
  2. NIH [U54-CA217377]
  3. Premium Postdoctoral Fellowship Program of the Hungarian Academy of Sciences
  4. Spanish Institute of Health Carlos III (ISCIII) [CP14/00229]
  5. Ministerio de Ciencia, Innovacion y Universidades (AEI/FEDER, UE) [SAF2017-89109-P]

Ask authors/readers for more resources

Background Many functional analysis tools have been developed to extract functional and mechanistic insight from bulk transcriptome data. With the advent of single-cell RNA sequencing (scRNA-seq), it is in principle possible to do such an analysis for single cells. However, scRNA-seq data has characteristics such as drop-out events and low library sizes. It is thus not clear if functional TF and pathway analysis tools established for bulk sequencing can be applied to scRNA-seq in a meaningful way. Results To address this question, we perform benchmark studies on simulated and real scRNA-seq data. We include the bulk-RNA tools PROGENy, GO enrichment, and DoRothEA that estimate pathway and transcription factor (TF) activities, respectively, and compare them against the tools SCENIC/AUCell and metaVIPER, designed for scRNA-seq. For the in silico study, we simulate single cells from TF/pathway perturbation bulk RNA-seq experiments. We complement the simulated data with real scRNA-seq data upon CRISPR-mediated knock-out. Our benchmarks on simulated and real data reveal comparable performance to the original bulk data. Additionally, we show that the TF and pathway activities preserve cell type-specific variability by analyzing a mixture sample sequenced with 13 scRNA-seq protocols. We also provide the benchmark data for further use by the community. Conclusions Our analyses suggest that bulk-based functional analysis tools that use manually curated footprint gene sets can be applied to scRNA-seq data, partially outperforming dedicated single-cell tools. Furthermore, we find that the performance of functional analysis tools is more sensitive to the gene sets than to the statistic used.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available