4.8 Article

Streamlining CRISPR spacer-based bacterial host predictions to decipher the viral dark matter

Journal

NUCLEIC ACIDS RESEARCH
Volume 49, Issue 6, Pages 3127-3138

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkab133

Keywords

-

Funding

  1. Joint Programming Initiative 'Healthy Diet for a Healthy Life'
  2. Danish Agency for Science and Higher Education
  3. Canadian Institutes of Health Research (Team grant on Intestinal Microbiomics, Institute of Nutrition, Metabolism and Diabetes) [143924]
  4. Sentinel North program of Universite Laval
  5. Canada First Research Excellence Fund
  6. Fonds de Recherche du Quebec -Nature et Technologies [259257]
  7. Goran-Enhorning Graduate Student Research Award from the Canadian Allergy, Asthma and Immunology Foundation
  8. Fonds de Recherche du Quebec - Sante [36093]
  9. Novo Nordisk Foundation project grant in basic bioscience [NNF18OC0052965]
  10. Canada Research Chair in Medical Genomics [950-231574]
  11. Tier 1 Canada Research Chair in Bacteriophages [950-232136]
  12. Canada Research Chair in Bacteriophages [950-232136]

Ask authors/readers for more resources

A large number of new phages have been discovered through viral metagenomics, with their host prediction based on CRISPR spacers which represent past phage-bacteria interactions. A set of tools has been developed for predicting hosts of uncharacterized phages, utilizing a database of over 11 million spacers and a program for large viral datasets. Performance evaluations show promise for gut-virome characterization using this host prediction method.
Thousands of new phages have recently been discovered thanks to viral metagenomics. These phages are extremely diverse and their genome sequences often do not resemble any known phages. To appreciate their ecological impact, it is important to determine their bacterial hosts. CRISPR spacers can be used to predict hosts of unknown phages, as spacers represent biological records of past phage-bacteria interactions. However, no guidelines have been established to standardize host prediction based on CRISPR spacers. Additionally, there are no tools that use spacers to perform host predictions on large viral datasets. Here, we developed a set of tools that includes all the necessary steps for predicting the hosts of uncharacterized phages. We created a database of >11 million spacers and a program to execute host predictions on large viral datasets. Our host prediction approach uses biological criteria inspired by how CRISPR-Cas naturally work as adaptive immune systems, which make the results easy to interpret. We evaluated the performance using 9484 phages with known hosts and obtained a recall of 49% and a precision of 69%. We also found that this host prediction method yielded higher performance for phages that infect gut-associated bacteria, suggesting it is well suited for gut-virome characterization.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available