4.7 Article Proceedings Paper

Sequence-Based Prediction of Putative Transcription Factor Binding Sites in DNA Sequences of Any Length

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2017.2773075

Keywords

Transcription factor binding site; DNA; support vector machine; protein-DNA interaction

Funding

  1. Basic Science Research Program through the National Research Foundation of Korea (NRF) - Ministry of Science, ICT & Future Planning [2015R1A1A3A04001243]
  2. Ministry of Education [2010-0020163]
  3. National Research Foundation of Korea [2015R1A1A3A04001243] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

A transcription factor (TF) is a protein that regulates gene expression by binding to specific DNA sequences. Despite recent advances in experimental techniques for identifying transcription factor binding sites (TFBS) in DNA sequences, a large number of TFBS are to be unveiled in many species. Several computational methods developed for predicting TFBS in DNA are tissue- or species-specific methods, and therefore cannot be used without prior knowledge of tissue or species. Some computational methods are applicable to identifying TFBS in short DNA sequences only. In this paper, we propose a new learning method for predicting TFBS in DNA of any length using the composition, transition, and distribution of nucleotides and amino acids in DNA and TF sequences. In independent testing of the method on datasets that were not used in training the method, the accuracy and MCC were as high as 81.84 percent and 0.634, respectively. The proposed method can be a useful aid for selecting potential TFBS in a large amount of DNA sequences before conducting biochemical experiments to empirically determine TFBS.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available