4.7 Article

CEDA: integrating gene expression data with CRISPR-pooled screen data identifies essential genes with higher expression

Journal

BIOINFORMATICS
Volume 38, Issue 23, Pages 5245-5252

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac668

Keywords

-

Funding

  1. National Institutes of Health (NIH) [P30CA016058]

Ask authors/readers for more resources

This study developed a statistical method called CEDA, which integrates gene expression data and CRISPR screening data for identifying essential genes. Compared to existing methods, CEDA shows higher sensitivity in detecting essential genes with moderate sgRNA fold change.
Motivation Clustered regularly interspaced short palindromic repeats (CRISPR)-based genetic perturbation screen is a powerful tool to probe gene function. However, experimental noises, especially for the lowly expressed genes, need to be accounted for to maintain proper control of false positive rate. Methods We develop a statistical method, named CRISPR screen with Expression Data Analysis (CEDA), to integrate gene expression profiles and CRISPR screen data for identifying essential genes. CEDA stratifies genes based on expression level and adopts a three-component mixture model for the log-fold change of single-guide RNAs (sgRNAs). Empirical Bayesian prior and expectation-maximization algorithm are used for parameter estimation and false discovery rate inference. Results Taking advantage of gene expression data, CEDA identifies essential genes with higher expression. Compared to existing methods, CEDA shows comparable reliability but higher sensitivity in detecting essential genes with moderate sgRNA fold change. Therefore, using the same CRISPR data, CEDA generates an additional hit gene list. Supplementary information are available at Bioinformatics online.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available