4.7 Article

Hierarchical deep learning for predicting GO annotations by integrating protein knowledge

Journal

BIOINFORMATICS
Volume 38, Issue 19, Pages 4488-4496

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac536

Keywords

-

Funding

  1. ANPCyT [0007, 3384]
  2. UNL [115]
  3. UNER [6204]
  4. European Molecular Biology Laboratory
  5. CABANA project-BBSRC [BB/P027849/1]

Ask authors/readers for more resources

This paper proposes a novel deep learning model called DeeProtGO for predicting GO term annotations by integrating protein knowledge. The experiments show that the prediction quality improves when more protein knowledge is integrated. DeeProtGO is benchmarked against state-of-the-art methods on public datasets and demonstrates effective improvement in the prediction of GO annotations.
Motivation: Experimental testing and manual curation are the most precise ways for assigning Gene Ontology (GO) terms describing protein functions. However, they are expensive, time-consuming and cannot cope with the exponential growth of data generated by high-throughput sequencing methods. Hence, researchers need reliable computational systems to help fill the gap with automatic function prediction. The results of the last Critical Assessment of Function Annotation challenge revealed that GO-terms prediction remains a very challenging task. Recent developments on deep learning are significantly breaking out the frontiers leading to new knowledge in protein research thanks to the integration of data from multiple sources. However, deep models hitherto developed for functional prediction are mainly focused on sequence data and have not achieved breakthrough performances yet. Results: We propose DeeProtGO, a novel deep-learning model for predicting GO annotations by integrating protein knowledge. DeeProtGO was trained for solving 18 different prediction problems, defined by the three GO sub-ontologies, the type of proteins, and the taxonomic kingdom. Our experiments reported higher prediction quality when more protein knowledge is integrated. We also benchmarked DeeProtGO against state-of-the-art methods on public datasets, and showed it can effectively improve the prediction of GO annotations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available