4.7 Article

RFCM-PALM: In-Silico Prediction of S-Palmitoylation Sites in the Synaptic Proteins for Male/Female Mouse Data

期刊

出版社

MDPI
DOI: 10.3390/ijms22189901

关键词

S-palmitoylation; post-translational modifications; feature selection; genetic algorithm; random-forest; consensus; knock-out; amino acid index; propensity; synaptic protein

资金

  1. National Science Centre, Poland [2017/26/E/NZ4/00637, 2019/34/E/NZ4/00387]
  2. Department of Biotechnology, Government of India [BT/PR16356/BID/7/596/2016]

向作者/读者索取更多资源

The study introduces a consensus strategy for predicting palmitoylated cysteine sites on synaptic proteins, which significantly improves prediction performance in mouse datasets through efficient feature selection and a three-star quality consensus strategy.
S-palmitoylation is a reversible covalent post-translational modification of cysteine thiol side chain by palmitic acid. S-palmitoylation plays a critical role in a variety of biological processes and is engaged in several human diseases. Therefore, identifying specific sites of this modification is crucial for understanding their functional consequences in physiology and pathology. We present a random forest (RF) classifier-based consensus strategy (RFCM-PALM) for predicting the palmitoylated cysteine sites on synaptic proteins from male/female mouse data. To design the prediction model, we have introduced a heuristic strategy for selection of the optimum set of physicochemical features from the AAIndex dataset using (a) K-Best (KB) features, (b) genetic algorithm (GA), and (c) a union (UN) of KB and GA based features. Furthermore, decisions from best-trained models of the KB, GA, and UN-based classifiers are combined by designing a three-star quality consensus strategy to further refine and enhance the scores of the individual models. The experiment is carried out on three categorized synaptic protein datasets of a male mouse, female mouse, and combined (male + female), whereas in each group, weighted data is used as training, and knock-out is used as the hold-out set for performance evaluation and comparison. RFCM-PALM shows similar to 80% area under curve (AUC) score in all three categories of datasets and achieve 10% average accuracy (male-15%, female-15%, and combined-7%) improvements on the hold-out set compared to the state-of-the-art approaches. To summarize, our method with efficient feature selection and novel consensus strategy shows significant performance gains in the prediction of S-palmitoylation sites in mouse datasets.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据