4.4 Article

A Spectral Rotation Method with Triplet Periodicity Property for Planted Motif Finding Problems

期刊

出版社

BENTHAM SCIENCE PUBL LTD
DOI: 10.2174/1386207322666191129112433

关键词

Gene detection; motif finding; visualization method; fast algorithm; Fourier spectrums; planted motif finding problem

资金

  1. National Natural Science Foundation of China [61873280, 61672033, 61972416, 61672248]
  2. Key Research and Development Program of Shandong Province [2017GGX10147]
  3. Natural Science Foundation of Shandong Province [ZR2017MF004]
  4. Fundamental Research Funds for the Central Universities [18CX02152A]
  5. Talent introduction project of China University of Petroleum [2017010054]
  6. AEI/FEDER, Spain-EU [TIN2016-81079-R]
  7. Talento-Comunidad de Madrid [2016-T2/TIC-2024]
  8. MINECO AEI/FEDER, Spain-EU [TIN2016-81079-R]
  9. InGEMICS-CM Project [B2017/BMD-3691]

向作者/读者索取更多资源

Background: Genes are known as functional patterns in the genome and are presumed to have biological significance. They can indicate binding sites for transcription factors and they encode certain proteins. Finding genes from biological sequences is a major task in computational biology for unraveling the mechanisms of gene expression. Objective: Planted motif finding problems arc a class of mathematical models abstracted from the process of detecting genes from genome, in which a specific gene with a number of mutations is planted into a randomly generated background sequence, and then gene finding algorithms can be tested to check if the planted gene can be found in feasible time. Methods: In this work, a spectral rotation method based on triplet periodicity property is proposed to solve planted motif finding problems. Results: The proposed method gives significant tolerance of base mutations in genes. Specifically, genes having a number of substitutions can be detected from randomly generated background sequences. Experimental results on genomic data set from Saccharomyces cerevisiae reveal that genes can be visually distinguished. It is proposed that genes with about 50% mutations can be detected from randomly generated background sequences. Conclusion: It is found that with about 5 insertions or deletions, this method fails in finding the planted genes. For a particular case, if the deletion of bases is located at the beginning of the gene, that is, bases are not randomly deleted, then the tolerance of the method for base deletion is increased.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据