4.7 Article

Prosit Transformer: A transformer for Prediction of MS2 Spectrum Intensities

期刊

JOURNAL OF PROTEOME RESEARCH
卷 21, 期 5, 页码 1359-1364

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jproteome.1c00870

关键词

Machine Learning; Proteomics; MS2 Spectra; Transformers

资金

  1. Swedish Foundation for Strategic Research [BD15-0043]
  2. European Union's Horizon 2020 Program [823839, H2020-INFRAIA-2018-1]
  3. Swedish Foundation for Strategic Research (SSF) [BD15-0043] Funding Source: Swedish Foundation for Strategic Research (SSF)

向作者/读者索取更多资源

Machine learning has long been essential in interpreting proteomics data from mass spectrometry. Recently, the Transformer model, successful in other fields of bioinformatics, has been applied with the convenience of transfer learning. In this study, a Transformer based on the TAPE model was implemented to predict MS2 intensities, outperforming the traditional recurrent neural network-based predictor Prosit.
Machine learning has been an integral part of interpreting data from mass spectrometry (MS)-based proteomics for a long time. Relatively recently, a machine-learning structure appeared successful in other areas of bioinformatics, Transformers. Furthermore, the implementation of Transformers within bioinformatics has become relatively convenient due to transfer learning, i.e., adapting a network trained for other tasks to new functionality. Transfer learning makes these relatively large networks more accessible as it generally requires less data, and the training time improves substantially. We implemented a Transformer based on the pretrained model TAPE to predict MS2 intensities. TAPE is a general model trained to predict missing residues from protein sequences. Despite being trained for a different task, we could modify its behavior by adding a prediction head at the end of the TAPE model and fine-tune it using the spectrum intensity from the training set to the well-known predictor Prosit. We demonstrate that the predictor, which we call Prosit Transformer, outperforms the recurrent neural-network-based predictor Prosit, increasing the median angular similarity on its holdout set from 0.908 to 0.929. We believe that Transformers will significantly increase prediction accuracy for other types of predictions within MS-based proteomics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据