4.8 Article

PARROT is a flexible recurrent neural network framework for analysis of large protein datasets

期刊

ELIFE
卷 10, 期 -, 页码 -

出版社

eLIFE SCIENCES PUBL LTD
DOI: 10.7554/eLife.70576

关键词

machine learning; high-throughput methods; proteomics; bioinformatics; functional annotation; Human

类别

资金

  1. National Science Foundation [DGE-2139839]

向作者/读者索取更多资源

The rise of high-throughput experiments has led to the development of new computational approaches, with machine learning methods, particularly deep learning, being increasingly utilized. PARROT is a general framework for training and applying deep learning-based predictors on large protein datasets, demonstrating ease of use and applicability for a wide range of biological problems.
The rise of high-throughput experiments has transformed how scientists approach biological questions. The ubiquity of large-scale assays that can test thousands of samples in a day has necessitated the development of new computational approaches to interpret this data. Among these tools, machine learning approaches are increasingly being utilized due to their ability to infer complex nonlinear patterns from high-dimensional data. Despite their effectiveness, machine learning (and in particular deep learning) approaches are not always accessible or easy to implement for those with limited computational expertise. Here we present PARROT, a general framework for training and applying deep learning-based predictors on large protein datasets. Using an internal recurrent neural network architecture, PARROT is capable of tackling both classification and regression tasks while only requiring raw protein sequences as input. We showcase the potential uses of PARROT on three diverse machine learning tasks: predicting phosphorylation sites, predicting transcriptional activation function of peptides generated by high-throughput reporter assays, and predicting the fibrillization propensity of amyloid beta with data generated by deep mutational scanning. Through these examples, we demonstrate that PARROT is easy to use, performs comparably to state-of-the-art computational tools, and is applicable for a wide array of biological problems.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据