期刊
ANALYTICAL CHEMISTRY
卷 86, 期 10, 页码 4758-4766出版社
AMER CHEMICAL SOC
DOI: 10.1021/ac4037679
关键词
-
资金
- Clayton Foundation
- Welch Foundation [F1515]
- Defense Advanced Research Projects Agency
- Defense Threat Reduction Agency
- National Institutes of Health (NIH) [5 RC1DA028779, GM 076536, DPI OD009572]
- Cancer Prevention and Research Institute of Texas
- NIH Western Research Center of Excellence in Biodefense (NIH) [5U54AI057156]
- Texas Institute for Drug and Diagnostics Development [TI-3D]
Characterizing the in vivo dynamics of the polyclonal antibody repertoire in serum, such as that which might arise in response to stimulation with an antigen, is difficult due to the presence of many highly similar immunoglobulin proteins, each specified by distinct B lymphocytes. These challenges have precluded the use of conventional mass spectrometry for antibody identification based on peptide mass spectral matches to a genomic reference database. Recently, progress has been made using bottom-up analysis of serum antibodies by nanoflow liquid chromatography/high-resolution tandem mass spectrometry combined with a sample-specific antibody sequence database generated by high-throughput sequencing of individual B cell immunoglobulin variable domains (V genes). Here, we describe how intrinsic features of antibody primary structure, most notably the interspersed segments of variable and conserved amino acid sequences, generate recurring patterns in the corresponding peptide mass spectra of V gene peptides, greatly complicating the assignment of correct sequences to mass spectral data. We show that the standard method of decoy-based error modeling fails to account for the error introduced by these highly similar sequences, leading to a significant underestimation of the false discovery rate. Because of these effects, antibody-derived peptide mass spectra require increased stringency in their interpretation. The use of filters based on the mean precursor ion mass accuracy of peptide-spectrum matches is shown to be particularly effective in distinguishing between true and false identifications. These findings highlight important caveats associated with the use of standard database search and error-modeling methods with nonstandard data sets and custom sequence databases.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据