4.8 Article

Dali server: structural unification of protein families

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Biochemistry & Molecular Biology

AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models

Mihaly Varadi et al.

Summary: AlphaFold DB is an openly accessible database with high-accuracy protein-structure predictions, powered by DeepMind's AlphaFold v2.0. It provides programmatic access to a vast number of predicted structures and is expanding to cover more sequences.

NUCLEIC ACIDS RESEARCH (2022)

Article Biochemistry & Molecular Biology

PANNZER-A practical tool for protein function prediction

Petri Toronen et al.

Summary: The advent of next-generation sequencing technology has resulted in a massive increase in gene catalogs for new genomes, transcriptomes, and metagenomes that require computational inference for functional annotation. PANNZER is a high-throughput functional annotation web server that supports annotation of up to 100,000 protein sequences and provides Gene Ontology annotations and free text description predictions. Two case studies highlight issues related to data quality and method evaluation, arguing that commonly used evaluation metrics and datasets may bias the development of automated function prediction methods.

PROTEIN SCIENCE (2022)

Article Biochemical Research Methods

The structural coverage of the human proteome before and after AlphaFold

Eduard Porta-Pardo et al.

Summary: The field of protein structure is undergoing a revolution, with advancements such as the AlphaFold database significantly improving our knowledge of human proteins. AlphaFold predictions enhance structural coverage and contribute to understanding important biomedical genes and mutations.

PLOS COMPUTATIONAL BIOLOGY (2022)

Article Biochemical Research Methods

Mining folded proteomes in the era of accurate structure prediction

Charles Bayly-Jones et al.

Summary: Protein structure plays a fundamental role in the function and processes of biological systems. Fold recognition algorithms provide a powerful tool to identify structural and functional similarities between distantly related homologs. With advances in machine learning techniques and a wealth of experimentally determined structures, previously curated sequence databases have become an important source of biological information. In this study, we use bioinformatic fold recognition algorithms to scan the entire AlphaFold structure database and identify novel protein family members, infer function, and group predicted protein structures. We identify novel, previously unknown members of various pore-forming protein families as an example of the utility of this approach.

PLOS COMPUTATIONAL BIOLOGY (2022)

Review Biochemistry & Molecular Biology

Type III CRISPR-Cas Systems: Deciphering the Most Complex Prokaryotic Immune System

Matvey V. Kolesnik et al.

Summary: The Type III CRISPR-Cas system, despite being among the most common, is one of the least investigated gene defense systems due to its complexity. This system recognizes and destroys foreign nucleic acids, with its effector complexes specifically targeting and cleaving RNA molecules.

BIOCHEMISTRY-MOSCOW (2021)

Article Biochemistry & Molecular Biology

CATH: increased structural coverage of functional space

Ian Sillitoe et al.

Summary: CATH identifies protein domains in structures and classifies them into evolutionary superfamilies, providing structural and functional annotations. The latest release significantly increases coverage of structural and sequence data, with additional derived data such as predicted sequence domains and functionally coherent sequence subsets. The FunFam generation pipeline has been re-engineered to capture more sequences with increased functional purity and information content.

NUCLEIC ACIDS RESEARCH (2021)

Article Multidisciplinary Sciences

Structure of SARS-CoV-2 ORF8, a rapidly evolving immune evasion protein

Thomas G. Flower et al.

Summary: The crystal structure of SARS-CoV-2 ORF8 was determined at 2.04-angstrom resolution by X-ray crystallography, revealing unique dimerization interfaces that may allow the protein to form large-scale assemblies, potentially mediating immune suppression and evasion activities.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2021)

Article Multidisciplinary Sciences

Structural characterization of the microbial enzyme urocanate reductase mediating imidazole propionate production

Raminta Venskutonyte et al.

Summary: This study reveals the association between the gut microbiota-produced imidazole propionate and type 2 diabetes, and provides important insights into the catalytic mechanism of the enzyme urocanate reductase through analysis of its crystal structures in four different states.

NATURE COMMUNICATIONS (2021)

Article Biochemistry & Molecular Biology

AlphaFold-Predicted Structures of KCTD Proteins Unravel Previously Undetected Relationships among the Members of the Family

Luciana Esposito et al.

Summary: One of the key features of KCTD proteins is their involvement in various physiological and pathological processes. Despite the lack of structural data, the use of predicted three-dimensional models by AlphaFold has enabled researchers to gain insights into the relationships within the KCTD family. A novel pseudo-phylogenetic tree based on a common structurally similar domain in the C-terminal region has revealed previously undetected similarities among the KCTD proteins, providing a solid foundation for understanding their diverse functions.

BIOMOLECULES (2021)

Article Multidisciplinary Sciences

Highly accurate protein structure prediction with AlphaFold

John Jumper et al.

Summary: Proteins are essential for life, and accurate prediction of their structures is a crucial research problem. Current experimental methods are time-consuming, highlighting the need for accurate computational approaches to address the gap in structural coverage. Despite recent progress, existing methods fall short of atomic accuracy in protein structure prediction.

NATURE (2021)

Article Multidisciplinary Sciences

Highly accurate protein structure prediction for the human proteome

Kathryn Tunyasuvunakool et al.

Summary: Using the AlphaFold method, the structural coverage of the proteome has been significantly expanded, covering 98.5% of human proteins with 58% of residues having confident predictions and 36% having very high confidence. Introducing new metrics to interpret the dataset and identify disordered regions, this study aims to provide high-quality predictions for generating biological hypotheses.

NATURE (2021)

Article Biochemistry & Molecular Biology

Topology evaluation of models for difficult targets in the 14th round of the critical assessment of protein structure prediction

Lisa N. Kinch et al.

Summary: This report summarizes the tertiary structure prediction assessment of difficult modeling targets in CASP14, with the top-performing AlphaFold2 group providing high quality models. Despite significant progress in protein structure prediction, challenges remain with flexible regions and obligate oligomeric assemblies. Performance-based PCA and heatmap clusters offer insight into target difficulties and successful structure prediction methodologies.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2021)

Article Multidisciplinary Sciences

Accurate prediction of protein structures and interactions using a three-track neural network

Minkyung Baek et al.

Summary: Through the three-track network, we achieved accuracies approaching those of DeepMind in CASP14, enabling rapid solution of challenging x-ray crystallography and cryo-electron microscopy structure modeling problems, and providing insights into the functions of proteins with currently unknown structure.

SCIENCE (2021)

Article Biochemistry & Molecular Biology

Pfam: The protein families database in 2021

Jaina Mistry et al.

Summary: The Pfam database has recently added a large number of protein families and domains, made revisions for COVID-19 research, and introduced Pfam-B as a supplement. These updates and improvements can help researchers classify protein sequences more effectively and conduct related studies.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemistry & Molecular Biology

DALI and the persistence of protein shape

Liisa Holm

PROTEIN SCIENCE (2020)

Article Biochemistry & Molecular Biology

FATCAT 2.0: towards a better understanding of the structural diversity of proteins

Zhanwen Li et al.

NUCLEIC ACIDS RESEARCH (2020)

Article Biochemical Research Methods

The X-ray crystal structure of the N-terminal domain of Ssr4, a Schizosaccharomyces pombe chromatin-remodelling protein

Janet Newman et al.

ACTA CRYSTALLOGRAPHICA SECTION F-STRUCTURAL BIOLOGY COMMUNICATIONS (2020)

Article Biochemical Research Methods

Benchmarking fold detection by DaliLite v.5

Liisa Holm

BIOINFORMATICS (2019)

Article Biochemical Research Methods

HH-suite3 for fast remote homology detection and deep protein annotation

Martin Steinegger et al.

BMC BIOINFORMATICS (2019)

Article Biochemistry & Molecular Biology

SCOPe: classification of large macromolecular structures in the structural classification of proteinsextended database

John-Marc Chandonia et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

ECOD: new developments in the evolutionary classification of domains

R. Dustin Schaeffer et al.

NUCLEIC ACIDS RESEARCH (2017)

Article Biochemistry & Molecular Biology

Dali server update

Liisa Holm et al.

NUCLEIC ACIDS RESEARCH (2016)

Article Biochemistry & Molecular Biology

SANSparallel: interactive homology search against Uniprot

Panu Somervuo et al.

NUCLEIC ACIDS RESEARCH (2015)

Article Biochemistry & Molecular Biology

MMDB and VAST+: tracking structural similarities between macromolecular complexes

Thomas Madej et al.

NUCLEIC ACIDS RESEARCH (2014)

Article Multidisciplinary Sciences

A formal test of the theory of universal common ancestry

Douglas L. Theobald

NATURE (2010)

Article Biochemical Research Methods

Searching protein structure databases with DaliLite v.3

L. Holm et al.

BIOINFORMATICS (2008)

Article Biochemistry & Molecular Biology

The natural history of the WRKY-GCM1 zinc fingers and the relationship between transcription factors and transposons

M. Madan Babu et al.

NUCLEIC ACIDS RESEARCH (2006)

Article Biochemistry & Molecular Biology

Structure of the conserved domain of ANAC, a member of the NAC family of transcription factors

HA Ernst et al.

EMBO REPORTS (2004)