4.2 Review

Identification of Single Amino Acid Substitutions in Proteogenomics

Journal

BIOCHEMISTRY-MOSCOW
Volume 83, Issue 3, Pages 250-258

Publisher

MAIK NAUKA/INTERPERIODICA/SPRINGER
DOI: 10.1134/S0006297918030057

Keywords

proteogenomics; proteomics; mass spectrometry; single amino acid polymorphism; single nucleotide polymorphism

Funding

  1. Russian Science Foundation [17-15-01229]

Ask authors/readers for more resources

An important aim of proteogenomics, which combines data of high throughput nucleic acid and protein analysis, is to reliably identify single amino acid substitutions representing a main type of coding genome variants. Exact knowledge of deviations from the consensus genome can be utilized in several biomedical fields, such as studies of expression of mutated proteins in cancer, deciphering heterozygosity mechanisms, identification of neoantigens in anticancer vaccine production, search for RNA editing sites at the level of the proteome, etc. Generation of this new knowledge requires processing of large data arrays from high-resolution mass spectrometry, where information on single-point protein variation is often difficult to extract. Accordingly, a significant problem in proteogenomic analysis is the presence of high levels of false positive results for variant-containing peptides in the produced results. Here we review recently suggested approaches of high quality proteomics data processing that may provide more reliable identification of single amino acid substitutions, especially contrary to residue modifications occurring in vitro and in vivo. Optimized methods for assessment of false discovery rate save instrumental and computational time spent for validation of interesting findings of amino acid polymorphism by orthogonal methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available