4.5 Article

DeepPPThermo: A Deep Learning Framework for Predicting Protein Thermostability Combining Protein-Level and Amino Acid-Level Features

期刊

JOURNAL OF COMPUTATIONAL BIOLOGY
卷 -, 期 -, 页码 -

出版社

MARY ANN LIEBERT, INC
DOI: 10.1089/cmb.2023.0097

关键词

attention mechanism; Bi-LSTM; deep learning; doc2vec; thermophilic proteins

向作者/读者索取更多资源

In this study, a deep learning-based classifier called DeepPPThermo was proposed, which combines classical sequence features and deep learning representation features to classify thermophilic and mesophilic proteins. The model utilizes deep neural networks and bi-long short-term memory networks to extract hidden features, and applies local and global attention mechanisms to assign different importance to multiview features. Experimental results show that our model outperforms other machine learning algorithms and deep learning algorithms. Furthermore, the robustness of the model and the importance of each feature have been demonstrated.
Using wet experimental methods to discover new thermophilic proteins or improve protein thermostability is time-consuming and expensive. Machine learning methods have shown powerful performance in the study of protein thermostability in recent years. However, how to make full use of multiview sequence information to predict thermostability effectively is still a challenge. In this study, we proposed a deep learning-based classifier named DeepPPThermo that fuses features of classical sequence features and deep learning representation features for classifying thermophilic and mesophilic proteins. In this model, deep neural network (DNN) and bi-long short-term memory (Bi-LSTM) are used to mine hidden features. Furthermore, local attention and global attention mechanisms give different importance to multiview features. The fused features are fed to a fully connected network classifier to distinguish thermophilic and mesophilic proteins. Our model is comprehensively compared with advanced machine learning algorithms and deep learning algorithms, proving that our model performs better. We further compare the effects of removing different features on the classification results, demonstrating the importance of each feature and the robustness of the model. Our DeepPPThermo model can be further used to explore protein diversity, identify new thermophilic proteins, and guide directed mutations of mesophilic proteins.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据