4.6 Review

Toward the solution of the protein structure prediction problem

期刊

JOURNAL OF BIOLOGICAL CHEMISTRY
卷 297, 期 1, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.jbc.2021.100870

关键词

-

资金

  1. National Institute of General Medical Sciences [GM136422, S10OD026825]
  2. National Institute of Allergy and Infectious Diseases [AI134678]
  3. National Science Foundation [IIS1901191, DBI2030790, MTM2025426]

向作者/读者索取更多资源

The protein structure prediction problem has been a key task in structural biology since 1973. Traditionally, template-based modeling (TBM) has been the most reliable approach for predicting protein structures, but recent research shows that end-to-end deep machine learning techniques can better solve this problem.
Since Anfinsen demonstrated that the information encoded in a protein's amino acid sequence determines its structure in 1973, solving the protein structure prediction problem has been the Holy Grail of structural biology. The goal of protein structure prediction approaches is to utilize computational modeling to determine the spatial location of every atom in a protein molecule starting from only its amino acid sequence. Depending on whether homologous structures can be found in the Protein Data Bank (PDB), structure prediction methods have been historically categorized as template-based modeling (TBM) or template-free modeling (FM) approaches. Until recently, TBM has been the most reliable approach to predicting protein structures, and in the absence of reliable templates, the modeling accuracy sharply declines. Nevertheless, the results of the most recent community-wide assessment of protein structure prediction experiment (CASP14) have demonstrated that the protein structure prediction problem can be largely solved through the use of end-to-end deep machine learning techniques, where correct folds could be built for nearly all single-domain proteins without using the PDB templates. Critically, the model quality exhibited little correlation with the quality of available template structures, as well as the number of sequence homologs detected for a given target protein. Thus, the implementation of deep-learning techniques has essentially broken through the 50-year-old modeling border between TBM and FM approaches and has made the success of high-resolution structure prediction significantly less dependent on template availability in the PDB library.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据