☆ 4.7 Article

SeqCP: A sequence-based algorithm for searching circularly permuted proteins

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2023)

期刊

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL

卷 21, 期 -, 页码 185-201

出版社

ELSEVIER

DOI: 10.1016/j.csbj.2022.11.024

关键词

Circular permutation; Circular permutants; Protein sequence analysis; Protein structure modeling

类别

Biochemistry & Molecular Biology Biotechnology & Applied Microbiology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Circular permutation (CP) is a protein sequence rearrangement that creates different positions for the termini of a protein along an imaginary circularized sequence. CP detection algorithms mainly rely on structural information, which limits their application to proteins with known structures. The development of a sequence-based CP search method is essential for identifying more CP pairs and advancing protein research.

Circular permutation (CP) is a protein sequence rearrangement in which the amino- and carboxyl-termini of a protein can be created in different positions along the imaginary circularized sequence. Circularly permutated proteins usually exhibit conserved three-dimensional structures and functions. By comparing the structures of circular permutants (CPMs), protein research and bioengineering applications can be approached in ways that are difficult to achieve by traditional mutagenesis. Most current CP detection algorithms depend on structural information. Because there is a vast number of proteins with unknown structures, many CP pairs may remain unidentified. An efficient sequence-based CP detector will help identify more CP pairs and advance many protein studies. For instance, some hypothetical proteins may have CPMs with known functions and structures that are informative for functional annotation, but existing structure-based CP search methods cannot be applied when those hypothetical proteins lack structural information. Despite the considerable potential for applications, sequence-based CP search methods have not been well developed. We present a sequence-based method, SeqCP, which analyzes normal and duplicated sequence alignments to identify CPMs and determine candidate CP sites for proteins. SeqCP was trained by data obtained from the Circular Permutation Database and tested with nonredundant datasets from the Protein Data Bank. It shows high reliability in CP identification and achieves an AUC of 0.9. SeqCP has been implemented into a web server available at: http://pcnas.life.nthu.edu.tw/ SeqCP/. (c) 2022 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

SeqCP: A sequence-based algorithm for searching circularly permuted proteins

期刊

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

SeqCP: A sequence-based algorithm for searching circularly permuted proteins

期刊

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文