4.7 Article

Genome-Wide Functional Annotation of Human Protein-Coding Splice Variants Using Multiple Instance Learning

期刊

JOURNAL OF PROTEOME RESEARCH
卷 15, 期 6, 页码 1747-1753

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jproteome.5b00883

关键词

alternative splicing; protein-coding splice variant (PCSV); functional annotation; gene ontology (GO); multiple instance learning (MIL); support vector machine (SVM); RNA-seq; ADAM15; LMNA/C; DMXL2; IsoFunc

资金

  1. NSF [1452656]
  2. National Institutes of Health [1R21NS082212-01, U54ES017885]

向作者/读者索取更多资源

The vast majority of human multiexon genes undergo alternative splicing and produce a variety of splice variant transcripts and proteins, which can perform different functions. These protein-coding splice variants (PCSVs) greatly increase the functional diversity of proteins. Most functional annotation algorithms have been developed at the gene level; the lack of;isoform-level gold standards is an important intellectual limitation for currently available machine learning algorithms. The accumulation of a large amount of RNA-seq data in the public domain greatly increases our ability to examine the functional annotation of genes at isoform level. In the present study, we used a multiple instance learning (MIL)-based approach for predicting the function of PCSVs. We used transcript-level expression values and gene level functional associations from the Gene Ontology database. A support vector machine (SVM)-based 5-fold cross-validation technique was applied. Comparatively, genes with multiple PCSVs performed better than single PCSV genes, and performance also improved when more examples were available to train the models. We demonstrated our predictions using literature evidence of ADAM15, LMNA/C, and DMXL2 genes. All predictions have been implemented in a web resource called IsoFunc, which is freely available for the global scientific community through http://guanlab.ccmb.med.umich.edu/isofunc.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据