期刊
JOURNAL OF PROTEOME RESEARCH
卷 17, 期 5, 页码 1801-1811出版社
AMER CHEMICAL SOC
DOI: 10.1021/acs.jproteome.7b00841
关键词
proteogenomics; melanoma; cell line; cancer genome; next-generation sequencing; shotgun proteomics; data integration; missense mutation
资金
- Russian Science Foundation [17-15-01229]
- Russian Foundation for Basic Research [16-54-21006]
The identification of genetically encoded variants at the proteome level is an important problem in cancer proteogenomics. The generation of customized protein databases from DNA or RNA sequencing data is a crucial stage of the identification workflow. Genomic data filtering applied at this stage may significantly modify variant search results, yet its effect is generally left out of the scope of proteogenomic studies. In this work, we focused on this impact using data of exome sequencing and LC-MS/MS analyses of six replicates for eight melanoma cell lines processed by a proteogenomics workflow. The main objectives were identifying variant peptides and revealing the role of the genomic data filtering in the variant identification. A series of six confidence thresholds for single nucleotide polymorphisms and indels from the exome data were applied to generate customized sequence databases of different stringency. In the searches against unfiltered databases, between 100 and 160 variant peptides were identified for each of the cell lines using X!Tandem and MS-GF+search engines. The recovery rate for variant peptides was similar to 1%, which is approximately three times lower than that of the wild-type peptides. Using unfiltered genomic databases for variant searches resulted in higher sensitivity and selectivity of the proteogenomic workflow and positively affected the ability to distinguish the cell lines based on variant peptide signatures.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据