4.8 Article

GeoBind: segmentation of nucleic acid binding interface on protein surface with geometric deep learning

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Biotechnology & Applied Microbiology

Large language models generate functional protein sequences across diverse families

Ali Madani et al.

Summary: A generative deep-learning model called ProGen is capable of designing artificial proteins with specific enzymatic activities. This model was trained on a large dataset of protein sequences and can generate protein sequences with predictable functions. ProGen can be further fine-tuned to improve the controllable generation of proteins from specific families, and artificial proteins generated by ProGen showed similar catalytic efficiencies to natural lysozymes.

NATURE BIOTECHNOLOGY (2023)

Article Biochemical Research Methods

ProteinBERT: a universal deep-learning model of protein sequence and function

Nadav Brandes et al.

Summary: Self-supervised deep language modeling has achieved unprecedented success with natural language tasks, and the authors introduce a new deep language model called ProteinBERT specifically designed for proteins, which efficiently handles long sequences and achieves near or even better performance than other methods, providing an effective framework for rapid training of protein predictors.

BIOINFORMATICS (2022)

Article Biochemical Research Methods

PST-PRNA: prediction of RNA-binding sites using protein surface topography and deep learning

Pengpai Li et al.

Summary: In this study, a novel model PST-PRNA is proposed for predicting RNA-binding sites based on protein surface topography. The model creates representative topography images of the entire protein surface using the 3D structural information of the protein and optimizes the potential features using deep learning models. Experimental results demonstrate that PST-PRNA successfully learns the latent structural information of protein surface and achieves excellent performance in testing.

BIOINFORMATICS (2022)

Article Biochemical Research Methods

ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction

Jerome Tubiana et al.

Summary: This study introduces ScanNet, a geometric deep learning model that learns features directly from the 3D structure of proteins to predict their functional sites. By training on protein-protein and protein-antibody binding sites, ScanNet demonstrates accuracy and interpretable learned features. Additionally, ScanNet successfully predicts antigenic epitopes of the SARS-CoV-2 spike protein.

NATURE METHODS (2022)

Article Biochemistry & Molecular Biology

GraphBind: protein structural context embedded rules learned by hierarchical graph neural networks for recognizing nucleic-acid-binding residues

Ying Xia et al.

Summary: The paper proposes GraphBind, an accurate predictor for identifying nucleic-acid-binding residues on proteins based on an end-to-end graph neural network. By constructing graphs based on structural contexts and using hierarchical graph neural networks to embed local patterns, GraphBind demonstrates superior performance in binding residue recognition.

NUCLEIC ACIDS RESEARCH (2021)

Article Multidisciplinary Sciences

DeepRank: a deep learning framework for data mining 3D protein-protein interfaces

Nicolas Renaud et al.

Summary: DeepRank is a deep learning framework for data mining large sets of 3D protein-protein interfaces, enabling efficient training with millions of PPIs and supporting both classification and regression. By addressing challenges such as distinguishing biological versus crystallographic PPIs and ranking docking models, DeepRank proves to be competitive with or outperform state-of-the-art methods, demonstrating its versatility in structural biology research.

NATURE COMMUNICATIONS (2021)

Article Multidisciplinary Sciences

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

Alexander Rives et al.

Summary: The deep contextual language model trained through unsupervised learning on protein sequences contains information about biological properties, has a multiscale structural organization, and can be used to improve predictions for protein mutational effects, secondary structure, and long-range contacts.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2021)

Article Multidisciplinary Sciences

Highly accurate protein structure prediction with AlphaFold

John Jumper et al.

Summary: Proteins are essential for life, and accurate prediction of their structures is a crucial research problem. Current experimental methods are time-consuming, highlighting the need for accurate computational approaches to address the gap in structural coverage. Despite recent progress, existing methods fall short of atomic accuracy in protein structure prediction.

NATURE (2021)

Article Multidisciplinary Sciences

Accurate prediction of protein structures and interactions using a three-track neural network

Minkyung Baek et al.

Summary: Through the three-track network, we achieved accuracies approaching those of DeepMind in CASP14, enabling rapid solution of challenging x-ray crystallography and cryo-electron microscopy structure modeling problems, and providing insights into the functions of proteins with currently unknown structure.

SCIENCE (2021)

Review Biochemistry & Molecular Biology

How RNA-Binding Proteins Interact with RNA: Molecules and Mechanisms

Meredith Corley et al.

MOLECULAR CELL (2020)

Review Multidisciplinary Sciences

Array programming with NumPy

Charles R. Harris et al.

NATURE (2020)

Article Multidisciplinary Sciences

A deep learning framework to predict binding preference of RNA constituents on protein surface

Jordy Homing Lam et al.

NATURE COMMUNICATIONS (2019)

Article Biochemical Research Methods

Antibody interface prediction with 3D Zernike descriptors and SVM

Sebastian Daberdaku et al.

BIOINFORMATICS (2019)

Review Biochemistry & Molecular Biology

The Human Transcription Factors

Samuel A. Lambert et al.

Article Chemistry, Medicinal

ATPbind: Accurate Protein-ATP Binding Site Prediction by Combining Sequence-Profiling and Structure-Based Comparisons

Jun Hu et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2018)

Editorial Material Engineering, Electrical & Electronic

Geometric Deep Learning Going beyond Euclidean data

Michael M. Bronstein et al.

IEEE SIGNAL PROCESSING MAGAZINE (2017)

Article Biotechnology & Applied Microbiology

Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning

Babak Alipanahi et al.

NATURE BIOTECHNOLOGY (2015)

Review Cell Biology

Specificity and nonspecificity in RNA-protein interactions

Eckhard Jankowsky et al.

NATURE REVIEWS MOLECULAR CELL BIOLOGY (2015)

Article Biochemistry & Molecular Biology

Prediction of nucleic acid binding probability in proteins: a neighboring residue network based score

Zhichao Miao et al.

NUCLEIC ACIDS RESEARCH (2015)

Review Genetics & Heredity

A census of human RNA-binding proteins

Stefanie Gerstberger et al.

NATURE REVIEWS GENETICS (2014)

Article Biochemical Research Methods

Designing Template-Free Predictor for Targeting Protein-Ligand Binding Sites with Classifier Ensemble and Spatial Clustering

Dong-Jun Yu et al.

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (2013)

Article Biochemistry & Molecular Biology

BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions

Jianyi Yang et al.

NUCLEIC ACIDS RESEARCH (2013)

Article Biochemical Research Methods

CD-HIT: accelerated for clustering the next-generation sequencing data

Limin Fu et al.

BIOINFORMATICS (2012)

Article Biochemical Research Methods

HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment

Michael Remmert et al.

NATURE METHODS (2012)

Article Biochemistry & Molecular Biology

Reorganizing the protein space at the Universal Protein Resource (UniProt)

Rolf Apweiler et al.

NUCLEIC ACIDS RESEARCH (2012)

Article Biochemical Research Methods

Prediction of protein-RNA binding sites by a random forest method with combined features

Zhi-Ping Liu et al.

BIOINFORMATICS (2010)

Article Biochemical Research Methods

Biopython: freely available Python tools for computational molecular biology and bioinformatics

Peter J. A. Cock et al.

BIOINFORMATICS (2009)

Article Biochemistry & Molecular Biology

MM-align: a quick algorithm for aligning multiple-chain protein complex structures using iterative dynamic programming

Srayanta Mukherjee et al.

NUCLEIC ACIDS RESEARCH (2009)

Article Multidisciplinary Sciences

Fast screening of protein surfaces using geometric invariant fingerprints

Shuangye Yin et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2009)

Article Biochemistry & Molecular Biology

RNABindR: a server for analyzing and predicting RNA-binding sites in proteins

Michael Terribilini et al.

NUCLEIC ACIDS RESEARCH (2007)

Article Biochemistry & Molecular Biology

BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences

Liangjiang Wang et al.

NUCLEIC ACIDS RESEARCH (2006)

Article Multidisciplinary Sciences

Ultraconserved elements in the human genome

G Bejerano et al.

SCIENCE (2004)

Article Multidisciplinary Sciences

Genome-wide location and function of DNA binding proteins

B Ren et al.

SCIENCE (2000)

Article Biochemistry & Molecular Biology

The Protein Data Bank

HM Berman et al.

NUCLEIC ACIDS RESEARCH (2000)