4.6 Article

Denovo-GCN: De Novo Peptide Sequencing by Graph Convolutional Neural Networks

Journal

APPLIED SCIENCES-BASEL
Volume 13, Issue 7, Pages -

Publisher

MDPI
DOI: 10.3390/app13074604

Keywords

tandem mass spectrum; de novo peptide sequencing; graph neural networks; spectrum graph

Ask authors/readers for more resources

This paper introduces a novel peptide sequencing method, Denovo-GCN, based on graph convolutional neural networks and convolutional neural networks. It can directly infer the peptide sequence from a tandem mass spectrum. By constructing an undirected graph, CNN is used to extract the features of the nodes, and the correlation between the nodes is represented by an adjacency matrix. Experiments show that Denovo-GCN outperforms DeepNovo with a relative improvement of 13.7-25.5% in terms of peptide-level recall.
Featured ApplicationProtein and peptide identification based on tandem mass spectrometry is a pillar technology in proteomics research. In recent years, increasing numbers of researchers have utilized deep learning to tackle challenges in proteomics. For example, catalyzed by deep learning, AlphaFold has achieved unparalleled levels of accuracy in protein-structure prediction. Prior to studying the structure and function of proteins in cells or tissues, it is essential to determine the sequences of amino acids in peptides or proteins. De novo peptide sequencing can be used to directly infer the peptide sequence from a tandem mass spectrum without the requirement for a reference sequence database, making it particularly suitable for the determination of protein sequences of unknown species, monoclonal antibodies, and cancer neoantigens.The de novo peptide-sequencing method can be used to directly infer the peptide sequence from a tandem mass spectrum. It has the advantage of not relying on protein databases and plays a key role in the determination of the protein sequences of unknown species, monoclonal antibodies, and cancer neoantigens. In this paper, we propose a method based on graph convolutional neural networks and convolutional neural networks, Denovo-GCN, for de novo peptide sequencing. We constructed an undirected graph based on the mass difference between the spectral peaks in a tandem mass spectrum. The features of the nodes on the spectrum graph, which represent the spectral peaks, were the matching information of the peptide sequence and the mass spectrum. Next, the Denovo-GCN used CNN to extract the features of the nodes. The correlation between the nodes was represented by an adjacency matrix, which aggregated the features of neighboring nodes. Denovo-GCN provides a complete end-to-end training and prediction framework to sequence patterns of peptides. Our experiments on various data sets from different species show that Denovo-GCN outperforms DeepNovo with a relative improvement of 13.7-25.5% in terms of the peptide-level recall.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available