4.5 Article

PredPromoter-MF(2L): A Novel Approach of Promoter Prediction Based on Multi-source Feature Fusion and Deep Forest

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s12539-022-00520-4

关键词

Promoter; Machine learning; Deep learning; Feature fusion; Feature selection; Deep Forest

资金

  1. National Natural Science Foundation of China [61972322]
  2. Natural Science Foundation of Shaanxi Province [2021JM-110]

向作者/读者索取更多资源

In this study, we proposed a novel two-layer predictor, PredPromoter-MF(2L), based on multi-source feature fusion and ensemble learning, and demonstrated its superiority in promoter prediction compared to existing methods.
Promoters short DNA sequences play vital roles in initiating gene transcription. However, it remains a challenge to identify promoters using conventional experiment techniques in a high-throughput manner. To this end, several computational predictors based on machine learning models have been developed, while their performance is unsatisfactory. In this study, we proposed a novel two-layer predictor, called PredPromoter-MF(2L), based on multi-source feature fusion and ensemble learning. PredPromoter-MF(2L) was developed based on various deep features learned by a pre-trained deep learning network model and sequence-derived features. Feature selection based on XGBoost was applied to reduce fused features dimensions, and a cascade deep forest model was trained on the selected feature subset for promoter prediction. The results both fivefold cross-validation and independent test demonstrated that PredPromoter-MF(2L) outperformed state-of-the-art methods. [GRAPHICS] .

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据