4.4 Article

GAMUT: A genomics big data management tool

期刊

JOURNAL OF BIOSCIENCES
卷 46, 期 4, 页码 -

出版社

INDIAN ACAD SCIENCES
DOI: 10.1007/s12038-021-00213-y

关键词

Big data; database; genomics; NGS; SNP; variant comparison; vcf

类别

资金

  1. BBSRC
  2. BBSRC [BB/K021362/1] Funding Source: UKRI

向作者/读者索取更多资源

GAMUT is a big data-based tool for efficient comparison of SNPs in different population samples, supporting dynamic querying and various charting options, with the ability to download data results for further analysis.
Efficient analysis of Single Nucleotide Polymorphisms (SNPs) across genomic samples enable in deciphering the relationship between genotype and phenotype. The core principle behind SNP comparison is to arrive at a probable list of variants that can differentiate two sets of data (populations). Such SNPs have direct applications in array design, genotype imputation and in cataloging of variants in regions of interest. We have developed GAMUT (Genomics bigdAta Management Tool), a big data-based solution for efficient run-time comparison of SNPs across large datasets based on partition of samples belonging to different populations taking into account user-defined splits. The tool is based on client-server architecture with MongoDB at the back-end and JSF with PrimeFaces as the front-end. It is readily deployable on wild-fly server as well as a docker container. Spark-based parallel data uploader enables optimal loading times. GAMUT enables dynamic querying of the large datasets consisting of multiple samples using text-based, chromosome position-based as well as gene-name based options. Various charting options like bar and pie charts along with tabular formats are available to ease the analysis of the queried data. The resultant data pertaining to comparison of genome-wide SNPs can also be downloaded in different formats like text, html, json for further stand-alone analysis.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据