4.7 Article

Use of shotgun proteomics for the identification, confirmation, and correction of C. elegans gene annotations

Journal

GENOME RESEARCH
Volume 18, Issue 10, Pages 1660-1669

Publisher

COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT
DOI: 10.1101/gr.077644.108

Keywords

-

Funding

  1. National Institutes of Health [R21-GM074787, R01-DK069386, P41-RR011823]
  2. Howard Hughes Medical Institute
  3. Medical Research Council [G0701197] Funding Source: researchfish
  4. MRC [G0701197] Funding Source: UKRI

Ask authors/readers for more resources

We describe a general mass spectrometry- based approach for gene annotation of any organism and demonstrate its effectiveness using the nematode Caenorhabditis elegans. We detected 6779 C. elegans proteins (67,047 peptides), including 384 that, although annotated in WormBase WS150, lacked cDNA or other prior experimental support. We also identified 429 new coding sequences that were unannotated in WS150. Nearly half (192/ 429) of the new coding sequences were confirmed with RT- PCR data. Thirty- three (similar to 8%) of the new coding sequences had been predicted to be pseudogenes, 151 (similar to 35%) reveal apparent errors in gene models, and 245 (57%) appear to be novel genes. In addition, we verified 6010 exon - exon splice junctions within existing WormBase gene models. Our work confirms that mass spectrometry is a powerful experimental tool for annotating sequenced genomes. In addition, the collection of identified peptides should facilitate future proteomics experiments targeted at specific proteins of interest.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available