期刊
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS
卷 20, 期 2, 页码 1319-1326出版社
IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2022.3199758
关键词
Transcription factor prediction; PSSM; DNA binding protein; deep learning; neural network
Transcription factors (TFs) are DNA binding proteins that regulate gene expression by binding to specific DNA sequences. While experimental methods for identifying TFs are time-consuming and costly, computational methods have been developed to overcome these limitations. However, the predictive performance of these tools can be further improved in terms of accuracy.
Transcription factors (TFs) are DNA binding proteins involved in the regulation of gene expression. They exist in all organisms and activate or repress transcription by binding to specific DNA sequences. Traditionally, TFs have been identified by experimental methods that are time-consuming and costly. In recent years, various computational methods have been developed to identify TF to overcome these limitations. However, there is a room for further improvement in the predictive performance of these tools in terms of accuracy. We report here a novel computational tool, TFnet, that provides accurate and comprehensive TF predictions from protein sequences. The accuracy of these predictions is substantially better than the results of the existing TF predictors and methods. Especially, it outperforms comparable methods significantly when sequence similarity to other known sequences in the database drops below 40%. Ablation tests reveal that the high predictive performance stems from innovative ways used in TFnet to derive sequence Position-Specific Scoring Matrix (PSSM) and encode inputs.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据