4.8 Article

ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Genetics & Heredity

Embeddings from protein language models predict conservation and variant effects

Celine Marquet et al.

Summary: The study utilized Protein Language Models (pLMs) to predict sequence conservation and SAV effects without requiring multiple sequence alignments (MSAs). The results showed that embeddings alone could accurately predict residue conservation almost as effectively as ConSeq using MSAs.

HUMAN GENETICS (2022)

Article Multidisciplinary Sciences

Embeddings from deep learning transfer GO annotations beyond homology

Maria Littmann et al.

Summary: This study proposes a GO term prediction method based on SeqVec embedding and protein proximity, with promising results especially for proteins from smaller families or with intrinsically disordered regions.

SCIENTIFIC REPORTS (2021)

Article Biochemistry & Molecular Biology

PredictProtein - Predicting Protein Structure and Function for 29 Years

Michael Bernhofer et al.

Summary: PredictProtein has been a one-stop online resource for protein sequence analysis since 1992, providing various predictions including protein structure, function, and binding. Recently, new prediction methods such as deep learning embeddings and prediction of protein and residues binding DNA, RNA, or other proteins have been added to enhance its usability for computational and experimental biologists.

NUCLEIC ACIDS RESEARCH (2021)

Article Multidisciplinary Sciences

Improved protein structure prediction using predicted interresidue orientations

Jianyi Yang et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2020)

Article Biochemistry & Molecular Biology

NetSurfP-2.0: Improved prediction of protein structural features by integrated deep learning

Michael Schantz Klausen et al.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2019)

Article Biochemistry & Molecular Biology

End-to-End Differentiable Learning of Protein Structure

Mohammed AlQuraishi

CELL SYSTEMS (2019)

Article Biochemical Research Methods

ProteinNet: a standardized data set for machine learning of protein structure

Mohammed AlQuraishi

BMC BIOINFORMATICS (2019)

Article Biochemical Research Methods

Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold

Martin Steinegger et al.

NATURE METHODS (2019)

Article Biochemical Research Methods

Unified rational protein engineering with sequence-based deep representation learning

Ethan C. Alley et al.

NATURE METHODS (2019)

Article Biochemical Research Methods

Modeling aspects of the language of life through transfer-learning protein sequences

Michael Heinzinger et al.

BMC BIOINFORMATICS (2019)

Article Biochemistry & Molecular Biology

UniProt: a worldwide hub of protein knowledge

Alex Bateman et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

SCOPe: classification of large macromolecular structures in the structural classification of proteinsextended database

John-Marc Chandonia et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

Assessment of hard target modeling in CASP12 reveals an emerging role of alignment-based contact prediction methods

Luciano A. Abriata et al.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2018)

Article Biochemistry & Molecular Biology

Evolutionary couplings and sequence variation effect predict protein binding sites

Maria Schelling et al.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2018)

Article Multidisciplinary Sciences

Clustering huge protein sequence sets in linear time

Martin Steinegger et al.

NATURE COMMUNICATIONS (2018)

Article Biochemical Research Methods

Dark Proteins Important for Cellular Function

Andrea Schafferhans et al.

PROTEOMICS (2018)

Letter Biotechnology & Applied Microbiology

MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets

Martin Steinegger et al.

NATURE BIOTECHNOLOGY (2017)

Article Biochemical Research Methods

DeepLoc: prediction of protein subcellular localization using deep learning

Jose Juan Almagro Armenteros et al.

BIOINFORMATICS (2017)

Proceedings Paper Computer Science, Artificial Intelligence

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi et al.

44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017) (2017)

Article Biochemistry & Molecular Biology

Accurate contact predictions using covariation techniques and machine learning

Tomasz Kosciolek et al.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2016)

Article Biochemistry & Molecular Biology

RaptorX-Property: a web server for protein structure property prediction

Sheng Wang et al.

NUCLEIC ACIDS RESEARCH (2016)

Article Biochemistry & Molecular Biology

TMSEG: Novel prediction of transmembrane helices

Michael Bernhofer et al.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2016)

Article Multidisciplinary Sciences

Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields

Sheng Wang et al.

SCIENTIFIC REPORTS (2016)

Article Biochemistry & Molecular Biology

JPred4: a protein secondary structure prediction server

Alexey Drozdetskiy et al.

NUCLEIC ACIDS RESEARCH (2015)

Article Multidisciplinary Sciences

Unexpected features of the dark proteome

Nelson Perdigao et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2015)

Article Biochemical Research Methods

UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches

Baris E. Suzek et al.

BIOINFORMATICS (2015)

Article Multidisciplinary Sciences

Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics

Ehsaneddin Asgari et al.

PLOS ONE (2015)

Article Biochemical Research Methods

LocTree2 predicts localization for all domains of life

Tatyana Goldberg et al.

BIOINFORMATICS (2012)

Article Biochemistry & Molecular Biology

Three-Dimensional Structures of Membrane Proteins from Genomic Sequencing

Thomas A. Hopf et al.

Article Multidisciplinary Sciences

Hydrophobic forces and the length limit of foldable protein domains

Milo M. Lin et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2012)

Article Multidisciplinary Sciences

Protein 3D Structure Computed from Evolutionary Sequence Variation

Debora S. Marks et al.

PLOS ONE (2011)

Article Biochemistry & Molecular Biology

PSI-2: Structural Genomics to Cover Protein Domain Family Space

Benoit H. Dessailly et al.

STRUCTURE (2009)

Article Biochemistry & Molecular Biology

Protein flexibility and intrinsic disorder

P Radivojac et al.

PROTEIN SCIENCE (2004)

Article Biochemical Research Methods

PISCES: a protein sequence culling server

GL Wang et al.

BIOINFORMATICS (2003)

Article Multidisciplinary Sciences

Enhanced protein domain discovery by using language modeling techniques from speech recognition

L Coin et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2003)

Article Biochemistry & Molecular Biology

The ENZYME database in 2000

A Bairoch

NUCLEIC ACIDS RESEARCH (2000)

Article Biochemistry & Molecular Biology

The Protein Data Bank

HM Berman et al.

NUCLEIC ACIDS RESEARCH (2000)