4.5 Article

Sequence similarity-driven proteomics in organisms with unknown genomes by LC-MS/MS and automated de novo sequencing

Journal

PROTEOMICS
Volume 7, Issue 14, Pages 2318-2329

Publisher

WILEY
DOI: 10.1002/pmic.200700003

Keywords

de novo sequencing; LC-MS/MS; MS BLAST; organisms with unknown genomes; sequence-similarity searches

Funding

  1. NIGMS NIH HHS [1R01GM070986-01A1] Funding Source: Medline

Ask authors/readers for more resources

LC-MS/MS analysis on a linear ion trap LTQ mass spectrometer, combined with data processing, stringent, and sequence-similarity database searching tools, was employed in a layered manner to identify proteins in organisms with unsequenced genomes. Highly specific stringent searches (MASCOT) were applied as a first layer screen to identify either known (i.e. present in a database) proteins, or unknown proteins sharing identical peptides with related database sequences. Once the confidently matched spectra were removed, the remainder was filtered against a nonannotated library of background spectra that cleaned up the dataset from spectra of common protein and chemical contaminants. The rectified spectral dataset was further subjected to rapid batch de novo interpretation by PepNovo software, followed by the MS BLAST sequence-similarity search that used multiple redundant and partially accurate candidate peptide sequences. Importantly, a single dataset was acquired at the uncompromised sensitivity with no need of manual selection of MS/MS spectra for subsequent de novo interpretation. This approach enabled a completely automated identification of novel proteins that were, otherwise, missed by conventional database searches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available