4.7 Article

Enhancing knowledge discovery from cancer genomics data with Galaxy

期刊

GIGASCIENCE
卷 6, 期 5, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/gigascience/gix015

关键词

Lymphoma; Driver; Cancer; Genome; Pipeline; Workflow; Tool; Cloud

资金

  1. Carlos Slim Health Institute
  2. Genome Canada and Genome British Columbia [173CIC]
  3. CIHR
  4. Canadian Institutes for Health Research
  5. Terry Fox Research Institute
  6. Mitacs
  7. Amazon AWS research grant

向作者/读者索取更多资源

The field of cancer genomics has demonstrated the power of massively parallel sequencing techniques to inform on the genes and specific alterations that drive tumor onset and progression. Although large comprehensive sequence data sets continue to be made increasingly available, data analysis remains an ongoing challenge, particularly for laboratories lacking dedicated resources and bioinformatics expertise. To address this, we have produced a collection of Galaxy tools that represent many popular algorithms for detecting somatic genetic alterations from cancer genome and exome data. We developed new methods for parallelization of these tools within Galaxy to accelerate runtime and have demonstrated their usability and summarized their runtimes on multiple cloud service providers. Some tools represent extensions or refinement of existing toolkits to yield visualizations suited to cohort-wide cancer genomic analysis. For example, we present Oncocircos and Oncoprintplus, which generate data-rich summaries of exome-derived somatic mutation. Workflows that integrate these to achieve data integration and visualizations are demonstrated on a cohort of 96 diffuse large B-cell lymphomas and enabled the discovery of multiple candidate lymphoma-related genes. Our toolkit is available from our GitHub repository as Galaxy tool and dependency definitions and has been deployed using virtualization on multiple platforms including Docker.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据