期刊
BIOINFORMATICS
卷 36, 期 12, 页码 3879-3881出版社
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btaa246
关键词
-
类别
资金
- National Institute on Aging [U24-AG041689, U54-AG052427, U01-AG032984, T32-AG00255]
- Biomarkers Across Neurodegenerative Diseases (BAND 3) [18062]
- Michael J Fox Foundation
- Alzheimer's Association
- Alzheimer's Research UK
- Weston Brain institute
aSummary: We report Spark-based INFERence of the molecular mechanisms of NOn-coding genetic variants (SparkINFERNO), a scalable bioinformatics pipeline characterizing non-coding genome-wide association study (GWAS) association findings. SparkINFERNO prioritizes causal variants underlying GWAS association signals and reports relevant regulatory elements, tissue contexts and plausible target genes they affect. To achieve this, the SparkINFERNO algorithm integrates GWAS summary statistics with large-scale collection of functional genomics datasets spanning enhancer activity, transcription factor binding, expression quantitative trait loci and other functional datasets across more than 400 tissues and cell types. Scalability is achieved by an underlying API implemented using Apache Spark and Giggle-based genomic indexing. We evaluated SparkINFERNO on large GWASs and show that SparkINFERNO is more than 60 times efficient and scales with data size and amount of computational resources.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据