期刊
JOURNAL OF PROTEOME RESEARCH
卷 14, 期 4, 页码 1880-1887出版社
AMER CHEMICAL SOC
DOI: 10.1021/pr501286b
关键词
Large-scale proteomics; RNAseq; Alternative splicing; Dominant isoforms; Protein structure; Protein function
资金
- National Institutes of Health (NIH) [U41 HG007234]
- Spanish Ministry of Economics and Competiveness [BIO2012-40205, BIO2012-37926, RD07-0067-0014-COM-BIOMED, RETICS-RD12-0042-0056, PRB2-ProteoRed-PT13/0001/0017]
- EU-FP7 Project BLUEPRINT [282510]
- Spanish National Institute of Bioinformatics
Although eukaryotic cells express a wide range of alternatively spliced transcripts, it is not clear whether genes tend to express a range of transcripts simultaneously across cells, or produce dominant isoforms in a manner that is either tissue-specific or regardless of tissue. To date, large-scale investigations into the pattern of transcript expression across distinct tissues have produced contradictory results. Here, we attempt to determine whether genes express a dominant splice variant at the protein level. We interrogate peptides from eight large-scale human proteomics experiments and databases and find that there is a single dominant protein isoform, irrespective of tissue or cell type, for the vast majority of the protein-coding genes in these experiments, in partial agreement with the conclusions from the most recent large-scale RNAseq study. Remarkably, the dominant isoforms from the experimental proteomics analyses coincided overwhelmingly with the reference isoforms selected by two completely orthogonal sources, the consensus coding sequence variants, which are agreed upon by separate manual genome curation teams, and the principal isoforms from the APPRIS database, predicted automatically from the conservation of protein sequence, structure, and function.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据