4.7 Article

Cloud Computing-Based TagSNP Selection Algorithm for Human Genome Data

期刊

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES
卷 16, 期 1, 页码 1096-1110

出版社

MDPI AG
DOI: 10.3390/ijms16011096

关键词

SNPs; haplotype; cloud computing; parallel processing; MapReduce

资金

  1. Ministry of Science and Technology, Taiwan, R.O.C [MOST 103-2632-E-126-001-MY3]

向作者/读者索取更多资源

Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据