4.6 Article

iterb-PPse: Identification of transcriptional terminators in bacterial by incorporating nucleotide properties into PseKNC

期刊

PLOS ONE
卷 15, 期 5, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pone.0228479

关键词

-

资金

  1. National Natural Science Foundation of China [61762026, 61462018]
  2. Guangxi Natural Science Foundation [2017GXNSFAA198278, 2016GXNSFAA380043]
  3. Innovation Project of GUET Graduate Education [2018YJCX47, 2019YCXS056]
  4. Guangxi Colleges and Universities Key Laboratory of Intelligent Processing of Computer Images and Graphics [GIIP201502]
  5. Guangxi Key Laboratory of Trusted Software [kx201403]

向作者/读者索取更多资源

Terminator is a DNA sequence that gives the RNA polymerase the transcriptional termination signal. Identifying terminators correctly can optimize the genome annotation, more importantly, it has considerable application value in disease diagnosis and therapies. However, accurate prediction methods are deficient and in urgent need. Therefore, we proposed a prediction method iterb-PPse for terminators by incorporating 47 nucleotide properties into PseKNC-I and PseKNC-II and utilizing Extreme Gradient Boosting to predict terminators based on Escherichia coli and Bacillus subtilis. Combing with the preceding methods, we employed three new feature extraction methods K-pwm, Base-content, Nucleotidepro to formulate raw samples. The two-step method was applied to select features. When identifying terminators based on optimized features, we compared five single models as well as 16 ensemble models. As a result, the accuracy of our method on benchmark dataset achieved 99.88%, higher than the existing state-of-the-art predictor iTerm-PseKNC in 100 times five-fold cross-validation test. Its prediction accuracy for two independent datasets reached 94.24% and 99.45% respectively. For the convenience of users, we developed a software on the basis of iterb-PPse with the same name. The open software and source code of iterb-PPse are available at https://github.com/Sarahyouzi/iterb-PPse.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据