4.7 Article

scESI: evolutionary sparse imputation for single-cell transcriptomes from nearest neighbor cells

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 23, Issue 5, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac144

Keywords

single-cell RNA-seq; imputation; sparse representation; multiobjective evolutionary algorithm

Funding

  1. National Natural Science Foundation of China [62072095, 61771165]
  2. National Key R&D Program of China [2021YFC2100100]
  3. Innovation Project of State Key Laboratory of Tree Genetics and Breeding (NortheastForestryUniversity) [2019A04]

Ask authors/readers for more resources

In this study, the authors propose an evolutionary sparse imputation (ESI) algorithm for single-cell transcriptomes to address the dropout problem and reduce data noise in gene expression profiles. The ESI algorithm constructs a sparse representation model based on gene regulation relationships between cells and uses an optimization framework based on nondominated sorting genetics to iteratively search for the global optimal solution. The results show that scESI outperforms benchmark methods in simulated datasets and real scRNA-seq datasets, improving cell classification, trajectory reconstruction, and identification of differentially expressed genes.
The ubiquitous dropout problem in single-cell RNA sequencing technology causes a large amount of data noise in the gene expression profile. For this reason, we propose an evolutionary sparse imputation (ESI) algorithm for single-cell transcriptomes, which constructs a sparse representation model based on gene regulation relationships between cells. To solve this model, we design an optimization framework based on nondominated sorting genetics. This framework takes into account the topological relationship between cells and the variety of gene expression to iteratively search the global optimal solution, thereby learning the Pareto optimal cell-cell affinity matrix. Finally, we use the learned sparse relationship model between cells to improve data quality and reduce data noise. In simulated datasets, scESI performed significantly better than benchmark methods with various metrics. By applying scESI to real scRNA-seq datasets, we discovered scESI can not only further classify the cell types and separate cells in visualization successfully but also improve the performance in reconstructing trajectories differentiation and identifying differentially expressed genes. In addition, scESI successfully recovered the expression trends of marker genes in stem cell differentiation and can discover new cell types and putative pathways regulating biological processes.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available