☆ 4.7 Review

Systematic evaluation of machine learning methods for identifying human-pathogen protein-protein interactions

BRIEFINGS IN BIOINFORMATICS (2021)

Journal

BRIEFINGS IN BIOINFORMATICS

Volume 22, Issue 3, Pages -

Publisher

OXFORD UNIV PRESS

DOI: 10.1093/bib/bbaa068

Keywords

bioinformatics; human-pathogen interactions; protein-protein interactions; systematic evaluation; sequential analysis; machine learning

Funding

University Global Partnership Network (UGPN)
National Health and Medical Research Council of Australia (NHMRC) [1144652]
Australian Research Council (ARC) [LP110200333, DP120104460]
National Institute of Allergy and Infectious Diseases of the National Institutes of Health [R01 AI111965]
Major Inter-Disciplinary Research (IDR) project - Monash University
Collaborative Research Program of Institute for Chemical Research, Kyoto University [2019-32]
National Health and Medical Research Council of Australia [1144652] Funding Source: NHMRC

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper systematically evaluates machine learning-based computational methods for human-bacterium protein-protein interactions (HB-PPIs). By reviewing publicly available databases of HP-PPIs, identifying bacterium pathogens, summarizing existing models, and evaluating the performance of machine learning models, valuable insights are provided for predicting HB-PPIs.

In recent years, high-throughput experimental techniques have significantly enhanced the accuracy and coverage of protein-protein interaction identification, including human-pathogen protein-protein interactions (HP-PPIs). Despite this progress, experimental methods are, in general, expensive in terms of both time and labour costs, especially considering that there are enormous amounts of potential protein-interacting partners. Developing computational methods to predict interactions between human and bacteria pathogen has thus become critical and meaningful, in both facilitating the detection of interactions and mining incomplete interaction maps. In this paper, we present a systematic evaluation of machine learning-based computational methods for human-bacterium protein-protein interactions (HB-PPIs). We first reviewed a vast number of publicly available databases of HP-PPIs and then critically evaluate the availability of these databases. Benefitting from its well-structured nature, we subsequently preprocess the data and identified six bacterium pathogens that could be used to study bacterium subjects in which a human was the host. Additionally, we thoroughly reviewed the literature on 'host-pathogen interactions' whereby existing models were summarized that we used to jointly study the impact of different feature representation algorithms and evaluate the performance of existing machine learning computational models. Owing to the abundance of sequence information and the limited scale of other protein-related information, we adopted the primary protocol from the literature and dedicated our analysis to a comprehensive assessment of sequence information and machine learning models. A systematic evaluation of machine learning models and a wide range of feature representation algorithms based on sequence information are presented as a comparison survey towards the prediction performance evaluation of HB-PPIs.

Systematic evaluation of machine learning methods for identifying human-pathogen protein-protein interactions

Journal

BRIEFINGS IN BIOINFORMATICS

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Systematic evaluation of machine learning methods for identifying human-pathogen protein-protein interactions

Journal

BRIEFINGS IN BIOINFORMATICS

Publisher

OXFORD UNIV PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper