4.6 Article

Predicting subcellular location of protein with evolution information and sequence-based deep learning

Journal

BMC BIOINFORMATICS
Volume 22, Issue SUPPL 10, Pages -

Publisher

BMC
DOI: 10.1186/s12859-021-04404-0

Keywords

Subcellular prediction; Protein sequence; Evolution information; Deep learning; Multiple label classification

Funding

  1. National Key R&D Program of China [2020YFA0908400, KQTD20200820113106007]
  2. National Natural Science Foundation of China [62072107]
  3. Natural Science Foundation of Fujian Province of China [2020J01610]

Ask authors/readers for more resources

This study introduces a deep learning-based method that utilizes both amino acid composition sequences and evolution matrices of proteins, significantly improving the accuracy of predicting protein subcellular locations.
Background Protein subcellular localization prediction plays an important role in biology research. Since traditional methods are laborious and time-consuming, many machine learning-based prediction methods have been proposed. However, most of the proposed methods ignore the evolution information of proteins. In order to improve the prediction accuracy, we present a deep learning-based method to predict protein subcellular locations. Results Our method utilizes not only amino acid compositions sequence but also evolution matrices of proteins. Our method uses a bidirectional long short-term memory network that processes the entire protein sequence and a convolutional neural network that extracts features from protein sequences. The position specific scoring matrix is used as a supplement to protein sequences. Our method was trained and tested on two benchmark datasets. The experiment results show that our method yields accurate results on the two datasets with an average precision of 0.7901, ranking loss of 0.0758 and coverage of 1.2848. Conclusion The experiment results show that our method outperforms five methods currently available. According to those experiments, we can see that our method is an acceptable alternative to predict protein subcellular location.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available