期刊
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE
卷 -, 期 -, 页码 168-172出版社
IEEE COMPUTER SOC
DOI: 10.1109/BIBM49941.2020.9313267
关键词
cap analysis gene expression; transcription start site; promoter region; deep neural network; gene expression
类别
资金
- National Science Foundation [2015838, 1661414]
- National Institute of Health [R15HGM123407]
- Direct For Biological Sciences
- Div Of Biological Infrastructure [1661414] Funding Source: National Science Foundation
- Div Of Biological Infrastructure
- Direct For Biological Sciences [2015838] Funding Source: National Science Foundation
Gene transcription start site (TSS) identification is important to understanding transcriptional gene regulation. Cap Analysis Gene Expression (CAGE) experiments have recently become common practice for direct measurement of TSSs. Currently, CAGE data available in public databases created unprecedented opportunities to study gene transcriptional initiation mechanisms under various cellular conditions. However, due to potential transcriptional noises inherent in CAGE data, in-silico methods are required to identify bonafide TSSs from noises further. Here we present a computational approach dlCAGE, an end-to-end deep neural network to identify TSSs from CAGE data. dlCAGE incorporate de-novo DNA regulatory motif features discovered by DeepBind model architecture, as well as existing sequence and structural features. Testing results of dlCAGE in several cell lines in comparison with current state-of-the-art approaches showed its superior performance and promise in TSS identification from CAGE experiments.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据