4.7 Review

Using metagenomic data to boost protein structure prediction and discovery

Journal

Publisher

ELSEVIER
DOI: 10.1016/j.csbj.2021.12.030

Keywords

Metagenomics; Multiple sequence alignment; Enzyme design; CRISPR-Cas system; Antibiotic resistance; Microbiome

Funding

  1. Young Scholars Program of Shandong University [21320082064101]
  2. National Natural Science Foundation of China [82071122, 81773547]
  3. National Key Research and Development Program of China [2020YFC2003500]
  4. Construction Engineering Special Fund of 'Taishan Scholars' of Shandong Province [tsqn201909180]
  5. Program of Excellent young scholars of Shandong University
  6. F.R.S.-FNRS Fund for Scientific Research

Ask authors/readers for more resources

This article reviews the application of metagenomic data in protein structure prediction and discovery. It introduces widely used metagenomic databases and analyzes how metagenomic data has contributed to the improvement of structure prediction methods. The article also discusses the role of metagenomes in the discovery of enzymes, new CRISPR-Cas systems, and antibiotic resistance genes.
Over the past decade, metagenomic sequencing approaches have been providing an ever-increasing amount of protein sequence data at an astonishing rate. These constitute an invaluable source of information which has been exploited in various research fields such as the study of the role of the gut microbiota in human diseases and aging. However, only a small fraction of all metagenomic sequences collected have been functionally or structurally characterized, leaving much of them completely unexplored. Here, we review how this information has been used in protein structure prediction and protein discovery. We begin by presenting some widely used metagenomic databases and analyze in detail how metagenomic data has contributed to the impressive improvement in the accuracy of structure prediction methods in recent years. We then examine how metagenomic information can be exploited to annotate protein sequences. More specifically, we focus on the role of metagenomes in the discovery of enzymes and new CRISPR-Cas systems, and in the identification of antibiotic resistance genes. With this review, we provide an overview of how metagenomic data is currently revolutionizing our understanding of protein science. (C) 2021 The Author(s). Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available