4.8 Article

Taxonomy-aware, sequence similarity ranking reliably predicts phage-host relationships

Journal

BMC BIOLOGY
Volume 19, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s12915-021-01146-6

Keywords

Phage-host prediction; Phage; Prokaryote; Bacteria; Virus; Genome sequence; Bioinformatics

Categories

Funding

  1. Polish National Science Centre [2018/31/D/NZ2/00108, 2017/25/B/NZ2/00187]
  2. National Centre for Research and Development (NCBR, Poland) [LIDER/5/0023/L-10/18/NCBR/2019]

Ask authors/readers for more resources

Phirbo is a tool based on the extension of BLAST that improves the quality of phage host prediction, increasing precision and recall rates. It has high discriminatory power in identifying virus-host relationships compared to other tools.
Background Characterizing phage-host interactions is critical to understanding the ecological role of both partners and effective isolation of phage therapeuticals. Unfortunately, experimental methods for studying these interactions are markedly slow, low-throughput, and unsuitable for phages or hosts difficult to maintain in laboratory conditions. Therefore, a number of in silico methods emerged to predict prokaryotic hosts based on viral sequences. One of the leading approaches is the application of the BLAST tool that searches for local similarities between viral and microbial genomes. However, this prediction method has three major limitations: (i) top-scoring sequences do not always point to the actual host; (ii) mosaic virus genomes may match to many, typically related, bacteria; and (iii) viral and host sequences may diverge beyond the point where their relationship can be detected by a BLAST alignment. Results We created an extension to BLAST, named Phirbo, that improves host prediction quality beyond what is obtainable from standard BLAST searches. The tool harnesses information concerning sequence similarity and bacteria relatedness to predict phage-host interactions. Phirbo was evaluated on three benchmark sets of known virus-host pairs, and it improved precision and recall by 11-40 percentage points over currently available, state-of-the-art, alignment-based, alignment-free, and machine-learning host prediction tools. Moreover, the discriminatory power of Phirbo for the recognition of virus-host relationships surpassed the results of other tools by at least 10 percentage points (area under the curve = 0.95), yielding a mean host prediction accuracy of 57% and 68% at the genus and family levels, respectively, and drops by 12 percentage points when using only a fraction of viral genome sequences (3 kb). Finally, we provide insights into a repertoire of protein and ncRNA genes that are shared between phages and hosts and may be prone to horizontal transfer during infection. Conclusions Our results suggest that Phirbo is a simple and effective tool for predicting phage-host relationships.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available