4.8 Article

DeepLoc 2.0: multi-label subcellular localization prediction using protein language models

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Biochemical Research Methods

ProteinBERT: a universal deep-learning model of protein sequence and function

Nadav Brandes et al.

Summary: Self-supervised deep language modeling has achieved unprecedented success with natural language tasks, and the authors introduce a new deep language model called ProteinBERT specifically designed for proteins, which efficiently handles long sequences and achieves near or even better performance than other methods, providing an effective framework for rapid training of protein predictors.

BIOINFORMATICS (2022)

Article Biotechnology & Applied Microbiology

SignalP 6.0 predicts all five types of signal peptides using protein language models

Felix Teufel et al.

Summary: Signal peptides are short amino acid sequences that regulate protein secretion and translocation. SignalP 6.0, a machine learning model, is introduced to detect all types of signal peptides, including those applicable to metagenomic data.

NATURE BIOTECHNOLOGY (2022)

Article Computer Science, Artificial Intelligence

ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning

Ahmed Elnaggar et al.

Summary: Computational biology and bioinformatics provide valuable data for the development of language models in natural language processing. In this study, six different models were trained on protein sequence data and the resulting embeddings were used for various protein structure prediction tasks, demonstrating their advantages over traditional methods.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Biology

Deep protein representations enable recombinant protein expression prediction

Hannah-Marie Martiny et al.

Summary: In the production of industrial enzymes, a crucial process is recombinant gene expression to induce overexpression of enzymes in a host microbe. A machine learning model specific to Bacillus subtilis has been developed to predict expressibility, using millions of unlabeled proteins and a small labeled dataset. The model shows modest performance but is sufficient for prioritizing expression candidates in high-throughput studies, capturing various features related to protein expression.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2021)

Article Multidisciplinary Sciences

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

Alexander Rives et al.

Summary: The deep contextual language model trained through unsupervised learning on protein sequences contains information about biological properties, has a multiscale structural organization, and can be used to improve predictions for protein mutational effects, secondary structure, and long-range contacts.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2021)

Article Biotechnology & Applied Microbiology

Prediction of GPI-anchored proteins with pointer neural networks

Magnus Halldor Gislason et al.

Summary: GPI anchors are crucial for linking proteins to the outer face of the plasma membrane in eukaryotic cells. Researchers have developed a new method, NetGPI, based on recurrent neural networks and attention mechanism, for predicting GPI anchoring signals. NetGPI outperforms existing methods in discriminating between GPI-anchored proteins and other secretory proteins.

CURRENT RESEARCH IN BIOTECHNOLOGY (2021)

Review Biochemistry & Molecular Biology

A Brief History of Protein Sorting Prediction

Henrik Nielsen et al.

PROTEIN JOURNAL (2019)

Article Biology

Detecting sequence signals in targeting peptides using deep learning

Jose Juan Almagro Armenteros et al.

LIFE SCIENCE ALLIANCE (2019)

Review Cell Biology

Subcellular Localization and Dynamics of the Bcl-2 Family of Proteins

Nikolay Popgeorgiev et al.

FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY (2018)

Article Multidisciplinary Sciences

A subcellular map of the human proteome

Peter J. Thul et al.

SCIENCE (2017)

Article Biochemical Research Methods

DeepLoc: prediction of protein subcellular localization using deep learning

Jose Juan Almagro Armenteros et al.

BIOINFORMATICS (2017)

Review Cardiac & Cardiovascular Systems

Protein sorting gone wrong-VPS10P domain receptors in cardiovascular and metabolic diseases

Vanessa Schmidt et al.

ATHEROSCLEROSIS (2016)

Article Biochemical Research Methods

Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins

Shibiao Wan et al.

BMC BIOINFORMATICS (2016)

Article Biochemical Research Methods

UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches

Baris E. Suzek et al.

BIOINFORMATICS (2015)

Review Cell Biology

Protein Sorting at the trans-Golgi Network

Yusong Guo et al.

ANNUAL REVIEW OF CELL AND DEVELOPMENTAL BIOLOGY, VOL 30 (2014)

Article Multidisciplinary Sciences

The First Transmembrane Domain of Lipid Phosphatase SAC1 Promotes Golgi Localization

Jinzhi Wang et al.

PLOS ONE (2013)

Article Biochemical Research Methods

mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines

Shibiao Wan et al.

BMC BIOINFORMATICS (2012)

Article Biochemical Research Methods

Going from where to why-interpretable prediction of protein subcellular localization

Sebastian Briesemeister et al.

BIOINFORMATICS (2010)

Review Biotechnology & Applied Microbiology

Subcellular targeting strategies for drug design and delivery

Lawrence Rajendran et al.

NATURE REVIEWS DRUG DISCOVERY (2010)

Article Cell Biology

Lost in translation: the signal hypothesis

[Anonymous]

JOURNAL OF CELL BIOLOGY (2005)

Article Biochemistry & Molecular Biology

Multiple mechanisms regulate subcellular localization of human CDC6

LM Delmolino et al.

JOURNAL OF BIOLOGICAL CHEMISTRY (2001)