4.7 Article

St. Jude Cloud: A Pediatric Cancer Genomic Data-Sharing Ecosystem

期刊

CANCER DISCOVERY
卷 11, 期 5, 页码 1082-1099

出版社

AMER ASSOC CANCER RESEARCH
DOI: 10.1158/2159-8290.CD-20-1230

关键词

-

类别

资金

  1. Microsoft AI for Good program
  2. DNAnexus
  3. St. Jude Blue Sky initiative
  4. National Cancer Institute of the National Institutes of Health [R01CA216391]

向作者/读者索取更多资源

Effective data sharing is crucial in accelerating research and improving diagnostic precision, treatment efficacy, and long-term survival rates for pediatric cancer. St. Jude Cloud provides a cloud-based ecosystem with over 1.2 petabytes of genomic data from over 10,000 pediatric patients, enabling advanced data analysis and knowledge enhancement in pediatric cancer research.
Effective data sharing is key to accelerating research to improve diagnostic precision, treatment efficacy, and long-term survival in pediatric cancer and other childhood catastrophic diseases. We present St. Jude Cloud (https://www.stjude.cloud), a cloud-based data-sharing ecosystem for accessing, analyzing, and visualizing genomic data from >10,000 pediatric patients with cancer and long-term survivors, and >800 pediatric sickle cell patients. Harmonized genomic data totaling 1.25 petabytes are freely available, including 12,104 whole genomes, 7,697 whole exomes, and 2,202 transcriptomes. The resource is expanding rapidly, with regular data uploads from St. Jude's prospective clinical genomics programs. Three interconnected apps within the ecosystem-Genomics Platform, Pediatric Cancer Knowledgebase, and Visualization Community-enable simultaneously performing advanced data analysis in the cloud and enhancing the Pediatric Cancer knowledgebase. We demonstrate the value of the ecosystem through use cases that classify 135 pediatric cancer subtypes by gene expression profiling and map mutational signatures across 35 pediatric cancer subtypes. SIGNIFICANCE: To advance research and treatment of pediatric cancer, we developed St. Jude Cloud, a data-sharing ecosystem for accessing >1.2 petabytes of raw genomic data from >10,000 pediatric patients and survivors, innovative analysis workflows, integrative multiomics visualizations, and a knowledgebase of published data contributed by the global pediatric cancer community.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据