4.7 Article

Highly significant improvement of protein sequence alignments with AlphaFold2

Related references

Note: Only part of the references are listed.
Article Biochemistry & Molecular Biology

AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models

Mihaly Varadi et al.

Summary: AlphaFold DB is an openly accessible database with high-accuracy protein-structure predictions, powered by DeepMind's AlphaFold v2.0. It provides programmatic access to a vast number of predicted structures and is expanding to cover more sequences.

NUCLEIC ACIDS RESEARCH (2022)

Article Biochemical Research Methods

The structural coverage of the human proteome before and after AlphaFold

Eduard Porta-Pardo et al.

Summary: The field of protein structure is undergoing a revolution, with advancements such as the AlphaFold database significantly improving our knowledge of human proteins. AlphaFold predictions enhance structural coverage and contribute to understanding important biomedical genes and mutations.

PLOS COMPUTATIONAL BIOLOGY (2022)

Article Biochemistry & Molecular Biology

A Comprehensive Phylogenetic Analysis of the Serpin Superfamily

Matthew A. Spence et al.

Summary: This study used a structural alignment of diverse serpins to generate a comprehensive 6,000-sequence phylogeny, showing extensive diversification of the superfamily into many novel functional clades. Analysis indicated that the hub proteins are ancient and similar due to convergent evolution, rather than horizontal gene transfer as previously speculated. This work clarifies longstanding questions in the evolution of serpins and provides new directions for research in the field of serpin biology.

MOLECULAR BIOLOGY AND EVOLUTION (2021)

Article Multidisciplinary Sciences

Highly accurate protein structure prediction with AlphaFold

John Jumper et al.

Summary: Proteins are essential for life, and accurate prediction of their structures is a crucial research problem. Current experimental methods are time-consuming, highlighting the need for accurate computational approaches to address the gap in structural coverage. Despite recent progress, existing methods fall short of atomic accuracy in protein structure prediction.

NATURE (2021)

Article Multidisciplinary Sciences

Highly accurate protein structure prediction for the human proteome

Kathryn Tunyasuvunakool et al.

Summary: Using the AlphaFold method, the structural coverage of the proteome has been significantly expanded, covering 98.5% of human proteins with 58% of residues having confident predictions and 36% having very high confidence. Introducing new metrics to interpret the dataset and identify disordered regions, this study aims to provide high-quality predictions for generating biological hypotheses.

NATURE (2021)

Article Biochemical Research Methods

A novel sequence alignment algorithm based on deep learning of the protein folding code

Mu Gao et al.

Summary: The SAdLSA algorithm effectively learns protein folding code from experimentally determined protein structures, improving structural relationships detection in sequence comparisons. It demonstrates significant improvement over established approaches on challenging datasets, with a time complexity of O(N) thanks to GPU acceleration.

BIOINFORMATICS (2021)

Article Biochemistry & Molecular Biology

Pfam: The protein families database in 2021

Jaina Mistry et al.

Summary: The Pfam database has recently added a large number of protein families and domains, made revisions for COVID-19 research, and introduced Pfam-B as a supplement. These updates and improvements can help researchers classify protein sequences more effectively and conduct related studies.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemical Research Methods

Protein multiple alignments: sequence-based versus structure-based programs

Mathilde Carpentier et al.

BIOINFORMATICS (2019)

Review Biochemistry & Molecular Biology

Critical assessment of methods of protein structure prediction (CASP)-Round XIII

Andriy Kryshtafovych et al.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2019)

Article Biochemical Research Methods

mTM-align: an algorithm for fast and accurate multiple protein structure alignment

Runze Dong et al.

BIOINFORMATICS (2018)

Letter Biotechnology & Applied Microbiology

Nextflow enables reproducible computational workflows

Paolo Di Tommaso et al.

NATURE BIOTECHNOLOGY (2017)

Article Biochemistry & Molecular Biology

The Pfam protein families database: towards a more sustainable future

Robert D. Finn et al.

NUCLEIC ACIDS RESEARCH (2016)

Article Multidisciplinary Sciences

FAMSA: Fast and accurate multiple sequence alignment of huge protein families

Sebastian Deorowicz et al.

SCIENTIFIC REPORTS (2016)

Article Biochemistry & Molecular Biology

TCS: A New Multiple Sequence Alignment Reliability Measure to Estimate Alignment Accuracy and Improve Phylogenetic Tree Reconstruction

Jia-Ming Chang et al.

MOLECULAR BIOLOGY AND EVOLUTION (2014)

News Item Multidisciplinary Sciences

THE TOP 100 PAPERS

Richard Van Noorden et al.

NATURE (2014)

Article Biochemical Research Methods

CD-HIT: accelerated for clustering the next-generation sequencing data

Limin Fu et al.

BIOINFORMATICS (2012)

Article Biochemical Research Methods

Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee

Jia-Ming Chang et al.

BMC BIOINFORMATICS (2012)

Article Biotechnology & Applied Microbiology

Phylogenetic assessment of alignments reveals neglected tree signal in gaps

Christophe Dessimoz et al.

GENOME BIOLOGY (2010)

Review Biochemical Research Methods

Upcoming challenges for multiple sequence alignment methods in the high-throughput era

Carsten Kemena et al.

BIOINFORMATICS (2009)

Article Biochemical Research Methods

PROMALS: towards accurate multiple sequence alignments of distantly related proteins

Jimin Pei et al.

BIOINFORMATICS (2007)

Article Biochemical Research Methods

Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences

Weizhong Li et al.

BIOINFORMATICS (2006)

Article Biochemical Research Methods

The iRMSD: a local measure of sequence alignment accuracy using structural information

Fabrice Armougom et al.

BIOINFORMATICS (2006)

Article Biochemistry & Molecular Biology

MAFFT version 5: improvement in accuracy of multiple sequence alignment

K Katoh et al.

NUCLEIC ACIDS RESEARCH (2005)

Article Biochemistry & Molecular Biology

Scoring function for automated assessment of protein structure template quality

Y Zhang et al.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2004)

Article Biochemistry & Molecular Biology

3DCoffee: Combining protein sequences and structures within multiple sequence alignments

O O'Sullivan et al.

JOURNAL OF MOLECULAR BIOLOGY (2004)

Article Biochemistry & Molecular Biology

T-Coffee: A novel method for fast and accurate multiple sequence alignment

C Notredame et al.

JOURNAL OF MOLECULAR BIOLOGY (2000)

Article Biochemistry & Molecular Biology

The Protein Data Bank

HM Berman et al.

NUCLEIC ACIDS RESEARCH (2000)