期刊
BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS
卷 1867, 期 6, 页码 -出版社
ELSEVIER
DOI: 10.1016/j.bbagen.2023.130360
关键词
scATAC-seq; Tensor decomposition; Large sparse matrices; UMAP; Single-cell applications
TD method can be used to process large sparse matrix generated from scATAC-seq, providing UMAP embedding consistent with tissue specificity, and selecting genes associated with various biological enrichment terms and transcription factor targeting.
ATAC-seq is a powerful tool for measuring the landscape structure of a chromosome. scATAC-seq is a recently updated version of ATAC-seq performed in a single cell. The problem with scATAC-seq is data sparsity and most of the genomic sites are inaccessible. Here, tensor decomposition (TD) was used to fill in missing values. In this study, TD was applied to massive scATAC-seq datasets generated by approximately 200 bp intervals, and this number can reach 13,627,618. Currently, no other methods can deal with large sparse matrices. The proposed method could not only provide UMAP embedding that coincides with tissue specificity, but also select genes associated with various biological enrichment terms and transcription factor targeting. This suggests that TD is a useful tool to process a large sparse matrix generated from scATAC-seq.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据