4.7 Article

Floating Search Methodology for Combining Classification Models for Site Recognition in DNA Sequences

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2020.2974221

关键词

Biological cells; Bioinformatics; Genomics; Computational modeling; Support vector machines; Biological system modeling; Search problems; Site recognition; gene prediction; models combination

资金

  1. Spanish Ministry of Science and Innovation ts [TIN-201566108-P]
  2. Junta de Andalucia [P09-TIC-4623, P07TIC-2682]

向作者/读者索取更多资源

In this paper, a methodology for combining multiple sources of information to recognize functional sites is proposed, showing significant improvement over existing methods. The use of floating search challenges the standard assumption of using genomes that are not too close or too far from the human genome to enhance the recognition of functional sites.
Recognition of the functional sites of genes, such as translation initiation sites, donor and acceptor splice sites and stop codons, is a relevant part of many current problems in bioinformatics. The best approaches use sophisticated classifiers, such as support vector machines. However, with the rapid accumulation of sequence data, methods for combining many sources of evidence are necessary as it is unlikely that a single classifier can solve this problem with the best possible performance. A major issue is that the number of possible models to combine is large and the use of all of these models is impractical. In this paper we present a methodology for combining many sources of information to recognize any functional site using floating search, a powerful heuristics applicable when the cost of evaluating each solution is high. We present experiments on four functional sites in the human genome, which is used as the target genome, and use another 20 species as sources of evidence. The proposed methodology shows significant improvement over state-of-the-art methods. The results show an advantage of the proposed method and also challenge the standard assumption of using only genomes not very close and not very far from the human to improve the recognition of functional sites.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据