4.8 Article

Cell type annotation of single-cell chromatin accessibility data via supervised Bayesian embedding

期刊

NATURE MACHINE INTELLIGENCE
卷 4, 期 2, 页码 116-126

出版社

NATURE PORTFOLIO
DOI: 10.1038/s42256-021-00432-w

关键词

-

资金

  1. National Key Research and Development Program of China [2021YFF1200902]
  2. National Natural Science Foundation of China [61873141, 61721003, 61573207, U1736210]
  3. Guoqiang Institute, Tsinghua University
  4. Tsinghua-Fuzhou Institute for Data Technology

向作者/读者索取更多资源

The authors propose a probabilistic generative model called EpiAnno to automatically annotate single-cell chromatin accessibility sequencing (scCAS) data. The model is validated on multiple datasets and demonstrates advantages in interpretable embedding and biological implications.
Recent advances in single-cell technologies have enabled the characterization of epigenomic heterogeneity at the cellular level. Computational methods for automatic cell type annotation are urgently needed given the exponential growth in the number of cells. In particular, annotation of single-cell chromatin accessibility sequencing (scCAS) data, which can capture the chromatin regulatory landscape that governs transcription in each cell type, has not been fully investigated. Here we propose EpiAnno, a probabilistic generative model integrated with a Bayesian neural network, to annotate scCAS data automatically in a supervised manner. We systematically validate the superior performance of EpiAnno for both intra- and inter-dataset annotation on various datasets. We further demonstrate the advantages of EpiAnno for interpretable embedding and biological implications via expression enrichment analysis, partitioned heritability analysis, enhancer identification, cis-coaccessibility analysis and pathway enrichment analysis. In addition, we show that EpiAnno has the potential to reveal cell type-specific motifs and facilitate scCAS data simulation. The investigation of single-cell epigenomics with technologies such as single-cell chromatin accessibility sequencing (scCAS) presents an opportunity to expand the understanding of gene regulation at the cellular level. The authors develop a probabilistic generative model to better characterize cell heterogeneity and accurately annotate the cell type of scCAS data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据