4.7 Article

MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters

Journal

BIOINFORMATICS
Volume 35, Issue 17, Pages 2957-2965

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btz016

Keywords

-

Funding

  1. Fundamental Research Funds for the Central Universities [3132016306, 3132018227]
  2. National Natural Science Foundation of Liaoning Province [20180550307]
  3. National Scholarship Fund of China
  4. National Health and Medical Research Council of Australia (NHMRC) [APP490989, APP1127948, APP1144652]
  5. Australian Research Council (ARC) [LP110200333, DP120104460]
  6. National Institute of Allergy and Infectious Diseases of the National Institutes of Health [R01 AI111965]
  7. Major Inter-Disciplinary Research (IDR) project - Monash University
  8. Collaborative Research Program of Institute for Chemical Research, Kyoto University [2018-28]
  9. Informatics Institute of the School of Medicine at UAB

Ask authors/readers for more resources

Motivation: Promoters are short DNA consensus sequences that are localized proximal to the transcription start sites of genes, allowing transcription initiation of particular genes. However, the precise prediction of promoters remains a challenging task because individual promoters often differ from the consensus at one or more positions. Results: In this study, we present a new multi-layer computational approach, called MULTiPly, for recognizing promoters and their specific types. MULTiPly took into account the sequences themselves, including both local information such as k-tuple nucleotide composition, dinucleotide-based auto covariance and global information of the entire samples based on bi-profile Bayes and k-nearest neighbour feature encodings. Specifically, the F-score feature selection method was applied to identify the best unique type of feature prediction results, in combination with other types of features that were subsequently added to further improve the prediction performance of MULTiPly. Benchmarking experiments on the benchmark dataset and comparisons with five state-of-the-art tools show that MULTiPly can achieve a better prediction performance on 5-fold cross-validation and jackknife tests. Moreover, the superiority of MULTiPly was also validated on a newly constructed independent test dataset. MULTiPly is expected to be used as a useful tool that will facilitate the discovery of both general and specific types of promoters in the post-genomic era.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available