4.6 Article

IFF: Identifying key residues in intrinsically disordered regions of proteins using machine learning

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Biochemical Research Methods

ProteinBERT: a universal deep-learning model of protein sequence and function

Nadav Brandes et al.

Summary: Self-supervised deep language modeling has achieved unprecedented success with natural language tasks, and the authors introduce a new deep language model called ProteinBERT specifically designed for proteins, which efficiently handles long sequences and achieves near or even better performance than other methods, providing an effective framework for rapid training of protein predictors.

BIOINFORMATICS (2022)

Article Biochemistry & Molecular Biology

The return of the rings: Evolutionary convergence of aromatic residues in the intrinsically disordered regions of RNA-binding proteins for liquid-liquid phase separation

Wen-Lin Ho et al.

Summary: Aromatic residues are frequently found in the intrinsically disordered regions (IDRs) of RNA binding proteins (RBPs) and are often conserved among chordates. These residues play a crucial role in protein liquid-liquid phase separation (LLPS) and contribute to the formation of higher-order oligomers. This suggests that aromatic residues have played a significant role in the evolution of protein folding and biomolecular condensate assembly.

PROTEIN SCIENCE (2022)

Article Biology

Phase separation driven by interchangeable properties in the intrinsically disordered regions of protein paralogs

Shih-Hui Chiu et al.

Summary: The intrinsically disordered regions (IDRs) among paralogs have different physicochemical properties but converge on the same biophysical function. The diversity of IDRs can lead to different traits, but they ultimately evolve to achieve a converged function.

COMMUNICATIONS BIOLOGY (2022)

Article Biology

The dynamic properties of a nuclear coactivator binding domain are evolutionarily conserved

Elin Karlsson et al.

Summary: The evolution of proteins is constrained by their structure and function. A recent study on the Nuclear Coactivator Binding Domain (NCBD) from CREB-binding protein reveals that although it has increased thermodynamic stability during evolution, its dynamic biophysical properties have been maintained. These findings suggest that the dynamic properties of NCBD are important for its function.

COMMUNICATIONS BIOLOGY (2022)

Article Biochemical Research Methods

Discovering molecular features of intrinsically disordered regions by using evolution for contrastive learning

Alex X. Lu et al.

PLoS Computational Biology (2022)

Article Computer Science, Artificial Intelligence

ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning

Ahmed Elnaggar et al.

Summary: Computational biology and bioinformatics provide valuable data for the development of language models in natural language processing. In this study, six different models were trained on protein sequence data and the resulting embeddings were used for various protein structure prediction tasks, demonstrating their advantages over traditional methods.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Review Cell Biology

Biomolecular condensates at the nexus of cellular stress, protein aggregation disease and ageing

Simon Alberti et al.

Summary: Biomolecular condensates, formed through liquid-liquid phase separation in a tightly regulated manner, have fundamental roles in cellular organization and physiology. Recent studies provide insights into how cellular stress, ageing-related loss of homeostasis, and a decline in protein quality control may contribute to the formation of aberrant, disease-causing condensates.

NATURE REVIEWS MOLECULAR CELL BIOLOGY (2021)

Article Multidisciplinary Sciences

Learning the molecular grammar of protein condensates from sequence determinants and embeddings

Kadi L. Saar et al.

Summary: Research has shown that proteins prone to liquid-liquid phase separation are more disordered, less hydrophobic, and have lower Shannon entropy than other protein sequences. By using machine learning models and neural network language models, it is possible to predict and understand protein phase behavior effectively.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2021)

Article Multidisciplinary Sciences

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

Alexander Rives et al.

Summary: The deep contextual language model trained through unsupervised learning on protein sequences contains information about biological properties, has a multiscale structural organization, and can be used to improve predictions for protein mutational effects, secondary structure, and long-range contacts.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2021)

Review Biochemistry & Molecular Biology

On the Potential of Machine Learning to Examine the Relationship Between Sequence, Structure, Dynamics and Function of Intrinsically Disordered Proteins

Kresten Lindorff-Larsen et al.

Summary: Intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) play important roles in a wide range of biological functions and reveal novel mechanisms of interactions. Computational methods, including machine learning, are crucial for predicting the structures and functions of IDPs and IDRs. Experiments provide insights into complexes and may enable more accurate predictions.

JOURNAL OF MOLECULAR BIOLOGY (2021)

Article Multidisciplinary Sciences

Highly accurate protein structure prediction with AlphaFold

John Jumper et al.

Summary: Proteins are essential for life, and accurate prediction of their structures is a crucial research problem. Current experimental methods are time-consuming, highlighting the need for accurate computational approaches to address the gap in structural coverage. Despite recent progress, existing methods fall short of atomic accuracy in protein structure prediction.

NATURE (2021)

Article Multidisciplinary Sciences

Accurate prediction of protein structures and interactions using a three-track neural network

Minkyung Baek et al.

Summary: Through the three-track network, we achieved accuracies approaching those of DeepMind in CASP14, enabling rapid solution of challenging x-ray crystallography and cryo-electron microscopy structure modeling problems, and providing insights into the functions of proteins with currently unknown structure.

SCIENCE (2021)

Article Biochemistry & Molecular Biology

OMA orthology in 2021: website overhaul, conserved isoforms, ancestral gene order and more

Adrian M. Altenhoff et al.

Summary: OMA is a resource that elucidates evolutionary relationships among 2326 genes, providing functions such as pairwise and groupwise orthologs and functional annotations. The updated OMA database has been reorganized into gene-, group-, and genome-centric pages, with new features and improvements added.

NUCLEIC ACIDS RESEARCH (2021)

Article Chemistry, Multidisciplinary

NMR Experiments for Studies of Dilute and Condensed Protein Phases: Application to the Phase-Separating Protein CAPRIN1

Leo E. Wong et al.

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY (2020)

Article Biochemistry & Molecular Biology

Arginine-Enriched Mixed-Charge Domains Provide Cohesion for Nuclear Speckle Condensation

Jamie A. Greig et al.

MOLECULAR CELL (2020)

Article Multidisciplinary Sciences

TDP-43 α-helical structure tunes liquid-liquid phase separation and function

Alexander E. Conicella et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2020)

Article Multidisciplinary Sciences

Phosphoregulated FMRP phase separation models activity-dependent translation through bidirectional control of mRNA granule formation

Brian Tsang et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2019)

Article Multidisciplinary Sciences

Arabidopsis FLL2 promotes liquid-liquid phase separation of polyadenylation complexes

Xiaofeng Fang et al.

NATURE (2019)

Article Biochemistry & Molecular Biology

End-to-End Differentiable Learning of Protein Structure

Mohammed AlQuraishi

CELL SYSTEMS (2019)

Article Multidisciplinary Sciences

Distance-based protein folding powered by deep learning

Jinbo Xu

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2019)

Article Biochemistry & Molecular Biology

UniProt: a worldwide hub of protein knowledge

Alex Bateman et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

The physical forces mediating self-association and phase-separation in the C-terminal domain of TDP-43

Hao-Ru Li et al.

BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS (2018)

Article Biochemistry & Molecular Biology

Phase Transitions in the Assembly and Function of Human miRISC

Jessica Sheu-Gruttadauria et al.

Article Biochemistry & Molecular Biology

TAR DNA-binding protein 43 (TDP-43) liquid-liquid phase separation is mediated by just a few aromatic residues

Hao-Ru Li et al.

JOURNAL OF BIOLOGICAL CHEMISTRY (2018)

Article Multidisciplinary Sciences

Self-interaction of NPM1 modulates multiple mechanisms of liquid-liquid phase separation

Diana M. Mitrea et al.

NATURE COMMUNICATIONS (2018)

Article Biochemistry & Molecular Biology

Intrinsically disordered sequences enable modulation of protein phase separation through distributed tyrosine motifs

Yuan Lin et al.

JOURNAL OF BIOLOGICAL CHEMISTRY (2017)

Review Cell Biology

Biomolecular condensates: organizers of cellular biochemistry

Salman F. Banani et al.

NATURE REVIEWS MOLECULAR CELL BIOLOGY (2017)

Article Multidisciplinary Sciences

Structural and hydrodynamic properties of an intrinsically disordered region of a germ cell-specific protein on phase separation

Jacob P. Brady et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2017)

Review Multidisciplinary Sciences

Liquid phase condensation in cell physiology and disease

Yongdae Shin et al.

SCIENCE (2017)

Review Genetics & Heredity

Causes of evolutionary rate variation among protein sites

Julian Echave et al.

NATURE REVIEWS GENETICS (2016)

Article Biochemistry & Molecular Biology

A Liquid-to-Solid Phase Transition of the ALS Protein FUS Accelerated by Disease Mutation

Avinash Patel et al.

Article Biochemistry & Molecular Biology

Residue-by-Residue View of In Vitro FUS Granules that Bind the C-Terminal Domain of RNA Polymerase II

Kathleen A. Burke et al.

MOLECULAR CELL (2015)

Article Biochemistry & Molecular Biology

Polyalanine expansions drive a shift into α-helical clusters without amyloid-fibril formation

Saskia Polling et al.

NATURE STRUCTURAL & MOLECULAR BIOLOGY (2015)

Article Biochemistry & Molecular Biology

Polyalanine expansions drive a shift into α-helical clusters without amyloid-fibril formation

Saskia Polling et al.

NATURE STRUCTURAL & MOLECULAR BIOLOGY (2015)

Article Biochemical Research Methods

PLAAC: a web and command-line application to identify proteins with prion-like amino acid composition

Alex K. Lancaster et al.

BIOINFORMATICS (2014)

Article Biochemistry & Molecular Biology

Protein Expansion Is Primarily due to Indels in Intrinsically Disordered Regions

Sara Light et al.

MOLECULAR BIOLOGY AND EVOLUTION (2013)

Article Multidisciplinary Sciences

A High-Coverage Genome Sequence from an Archaic Denisovan Individual

Matthias Meyer et al.

SCIENCE (2012)

Article Medicine, Research & Experimental

Evolution of Hemoglobin and Its Genes

Ross C. Hardison

COLD SPRING HARBOR PERSPECTIVES IN MEDICINE (2012)

Article Biochemistry & Molecular Biology

Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega

Fabian Sievers et al.

MOLECULAR SYSTEMS BIOLOGY (2011)

Article Multidisciplinary Sciences

Direct-coupling analysis of residue coevolution captures native contacts across many protein families

Faruck Morcos et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2011)

Article Biochemistry & Molecular Biology

Learning generative models for protein fold families

Sivaraman Balakrishnan et al.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2011)

Review Biochemistry & Molecular Biology

Understanding protein non-folding

Vladimir N. Uversky et al.

BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS (2010)

Article Multidisciplinary Sciences

A Draft Sequence of the Neandertal Genome

Richard E. Green et al.

SCIENCE (2010)

Article Multidisciplinary Sciences

Identification of direct residue contacts in protein-protein interaction by message passing

Martin Weigt et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2009)

Review Biophysics

Intrinsic disorder and functional proteomics

Predrag Radivojac et al.

BIOPHYSICAL JOURNAL (2007)