4.7 Article

Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments

期刊

BRIEFINGS IN BIOINFORMATICS
卷 23, 期 5, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac355

关键词

gene set enrichment; pathways; Gene Ontology; benchmark; STRING; functional annotation

资金

  1. Swiss National Science Foundation [310030_192569]
  2. Alexion Pharmaceuticals Inc.
  3. University of Zurich
  4. Swiss National Science Foundation (SNF) [310030_192569] Funding Source: Swiss National Science Foundation (SNF)

向作者/读者索取更多资源

A knowledge-based grouping of genes into pathways or functional units is crucial for understanding cellular complexity. This study evaluates and compares existing and novel functional classification systems in terms of their discovery power and generality in gene set enrichment testing. The results show structural and performance differences between classification systems, with well-established, hierarchically organized pathway annotation systems performing best overall. On the other hand, recent unsupervised annotation systems excel in understudied areas and organisms, detecting more specific pathways.
A knowledge-based grouping of genes into pathways or functional units is essential for describing and understanding cellular complexity. However, it is not always clear a priori how and at what level of specificity functionally interconnected genes should be partitioned into pathways, for a given application. Here, we assess and compare nine existing and two conceptually novel functional classification systems, with respect to their discovery power and generality in gene set enrichment testing. We base our assessment on a collection of nearly 2000 functional genomics datasets provided by users of the STRING database. With these real-life and diverse queries, we assess which systems typically provide the most specific and complete enrichment results. We find many structural and performance differences between classification systems. Overall, the well-established, hierarchically organized pathway annotation systems yield the best enrichment performance, despite covering substantial parts of the human genome in general terms only. On the other hand, the more recent unsupervised annotation systems perform strongest in understudied areas and organisms, and in detecting more specific pathways, albeit with less informative labels.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据