期刊
BIOINFORMATICS
卷 37, 期 19, 页码 3228-3234出版社
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btab257
关键词
-
类别
资金
- Takeda Pharmaceuticals Company Limited
- F. Hoffman-La Roche Ltd
- National Institutes of Health through IRP NIMH [R01MH085542, R01MH093725, P50MH066392, P50MH080405, R01MH097276, RO1-MH-075916, P50M096891, P50MH084053S1, R37MH057881, AG02219, AG05138, MH06692, R01MH110921, R01MH109677, R01MH109897, U01MH103392, HHSN271201300031C]
- NIA [P30AG10161, R01AG15819, R01AG17917, R01AG30146, R01AG36042, RC2AG036547, R01AG36836, R01AG48015, RF1AG57473, U01AG32984, U01AG46152, U01AG46161, U01AG61356]
- Illinois Department of Public Health (ROSMAP)
- Translational Genomics Research Institute (genomic)
- National Institute of Mental Health (NIMH) [R37MH057881, R01MH123184, UO1NH122681]
- National Science Foundation [DMS-1553884, DMS-2015492]
Marker genes, primarily expressed in a single-cell type, are highly correlated in bulk data, allowing for their identification through correlation patterns. A new algorithm has been developed to detect these marker genes by combining published information and bulk transcriptome data, refining the list of known marker genes based on the correlation structure of the bulk data.
Motivation: Marker genes, defined as genes that are expressed primarily in a single-cell type, can be identified from the single-cell transcriptome; however, such data are not always available for the many uses of marker genes, such as deconvolution of bulk tissue. Marker genes for a cell type, however, are highly correlated in bulk data, because their expression levels depend primarily on the proportion of that cell type in the samples. Therefore, when many tissue samples are analyzed, it is possible to identify these marker genes from the correlation pattern. Results: To capitalize on this pattern, we develop a new algorithm to detect marker genes by combining published information about likely marker genes with bulk transcriptome data in the form of a semi-supervised algorithm. The algorithm then exploits the correlation structure of the bulk data to refine the published marker genes by adding or removing genes from the list.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据