4.7 Article

SparkINFERNO: a scalable high-throughput pipeline for inferring molecular mechanisms of non-coding genetic variants

期刊

BIOINFORMATICS
卷 36, 期 12, 页码 3879-3881

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btaa246

关键词

-

资金

  1. National Institute on Aging [U24-AG041689, U54-AG052427, U01-AG032984, T32-AG00255]
  2. Biomarkers Across Neurodegenerative Diseases (BAND 3) [18062]
  3. Michael J Fox Foundation
  4. Alzheimer's Association
  5. Alzheimer's Research UK
  6. Weston Brain institute

向作者/读者索取更多资源

aSummary: We report Spark-based INFERence of the molecular mechanisms of NOn-coding genetic variants (SparkINFERNO), a scalable bioinformatics pipeline characterizing non-coding genome-wide association study (GWAS) association findings. SparkINFERNO prioritizes causal variants underlying GWAS association signals and reports relevant regulatory elements, tissue contexts and plausible target genes they affect. To achieve this, the SparkINFERNO algorithm integrates GWAS summary statistics with large-scale collection of functional genomics datasets spanning enhancer activity, transcription factor binding, expression quantitative trait loci and other functional datasets across more than 400 tissues and cell types. Scalability is achieved by an underlying API implemented using Apache Spark and Giggle-based genomic indexing. We evaluated SparkINFERNO on large GWASs and show that SparkINFERNO is more than 60 times efficient and scales with data size and amount of computational resources.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据