4.7 Article

Learning spatial structures of proteins improves protein-protein interaction prediction

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 23, Issue 2, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbab558

Keywords

protein-protein interaction; protein representation learning; graph neural network; multi-dimension feature confusion

Funding

  1. National Natural Science Foundation of China [62122025, 62102140, 61972138, 61872309]
  2. Hunan Provincial Natural Science Foundation of China [J10020, J4215]
  3. Key Research and Development Program of Changsha [kq2004016]
  4. Open Research Projects of Zhejiang Lab [2021RDOAB02]

Ask authors/readers for more resources

The spatial structures of proteins are important for their functions, but the limited quantity of known protein structures restricts their application in prediction methods. Utilizing predicted protein structure information can improve sequence-based prediction methods. TAGPPI is a novel framework that uses only protein sequences to predict protein-protein interactions and extracts spatial structure information from contact maps to improve prediction performance.
Spatial structures of proteins are closely related to protein functions. Integrating protein structures improves the performance of protein-protein interaction (PPI) prediction. However, the limited quantity of known protein structures restricts the application of structure-based prediction methods. Utilizing the predicted protein structure information is a promising method to improve the performance of sequence-based prediction methods. We propose a novel end-to-end framework, TAGPPI, to predict PPIs using protein sequence alone. TAGPPI extracts multi-dimensional features by employing 1D convolution operation on protein sequences and graph learning method on contact maps constructed from AlphaFold. A contact map contains abundant spatial structure information, which is difficult to obtain from 1D sequence data directly. We further demonstrate that the spatial information learned from contact maps improves the ability of TAGPPI in PPI prediction tasks. We compare the performance of TAGPPI with those of nine state-of-the-art sequence-based methods, and TAGPPI outperforms such methods in all metrics. To the best of our knowledge, this is the first method to use the predicted protein topology structure graph for sequence-based PPI prediction. More importantly, our proposed architecture could be extended to other prediction tasks related to proteins.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available