4.7 Review

Extending the Horizon of Homology Detection with Coevolution-based Structure Prediction

期刊

JOURNAL OF MOLECULAR BIOLOGY
卷 433, 期 20, 页码 -

出版社

ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD
DOI: 10.1016/j.jmb.2021.167106

关键词

remote homology; CFAP298; C21ORF59; DISC1; coevolution

资金

  1. Medical Research Council [MC_UU_00007/15]
  2. MRC [MC_UU_00007/15] Funding Source: UKRI

向作者/读者索取更多资源

This review discusses how co-evolution-based contact and distance prediction methods expand the horizons of homology detection, providing new functional insights and experimentally testable hypotheses. These methods reveal three-dimensional constraints among amino acids in protein sequences previously devoid of annotated domains and repeats, exposing hidden structures in featureless sequence landscapes. The algorithms have a revelatory impact akin to the use of ground-penetrating radar by archaeologists to discover long-hidden structures underground.
Traditional sequence analysis algorithms fail to identify distant homologies when they lie beyond a detection horizon. In this review, we discuss how co-evolution-based contact and distance prediction methods are pushing back this homology detection horizon, thereby yielding new functional insights and experimentally testable hypotheses. Based on correlated substitutions, these methods divine three-dimensional constraints among amino acids in protein sequences that were previously devoid of all annotated domains and repeats. The new algorithms discern hidden structure in an otherwise featureless sequence landscape. Their revelatory impact promises to be as profound as the use, by archaeologists, of ground-penetrating radar to discern long-hidden, subterranean structures. As examples of this, we describe how triplicated structures reflecting longin domains in MON1A-like proteins, or UVR-like repeats in DISC1, emerge from their predicted contact and distance maps. These methods also help to resolve structures that do not conform to a beads-on-a-string model of protein domains. In one such example, we describe CFAP298 whose ubiquitin-like domain was previously challenging to perceive owing to a large sequence insertion within it. More generally, the new algorithms permit an easier appreciation of domain families and folds whose evolution involved structural insertion or rearrangement. As we exemplify with alpha 1-antitrypsin, coevolution-based predicted contacts may also yield insights into protein dynamics and conformational change. This new combination of structure prediction (using innovative co-evolution based methods) and homology inference (using more traditional sequence analysis approaches) shows great promise for bringing into view a sea of evolutionary relationships that had hitherto lain far beyond the horizon of homology detection. (C) 2021 The Authors. Published by Elsevier Ltd.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据