4.8 Article

A method to predict the impact of regulatory variants from DNA sequence

Journal

NATURE GENETICS
Volume 47, Issue 8, Pages 955-+

Publisher

NATURE PUBLISHING GROUP
DOI: 10.1038/ng.3331

Keywords

-

Funding

  1. US National Institutes of Health [R01 NS62972, R01 HG007348]
  2. NATIONAL HUMAN GENOME RESEARCH INSTITUTE [R01HG007348] Funding Source: NIH RePORTER
  3. NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES [K12GM068524, T32GM007814] Funding Source: NIH RePORTER
  4. NATIONAL INSTITUTE OF NEUROLOGICAL DISORDERS AND STROKE [R01NS062972] Funding Source: NIH RePORTER

Ask authors/readers for more resources

Most variants implicated in common human disease by genome-wide association studies (GWAS) lie in noncoding sequence intervals. Despite the suggestion that regulatory element disruption represents a common theme, identifying causal risk variants within implicated genomic regions remains a major challenge. Here we present a new sequence-based computational method to predict the effect of regulatory variation, using a classifier (gkm-SVM) that encodes cell type-specific regulatory sequence vocabularies. The induced change in the gkm-SVM score, deltaSVM, quantifies the effect of variants. We show that deltaSVM accurately predicts the impact of SNPs on DNase I sensitivity in their native genomic contexts and accurately predicts the results of dense mutagenesis of several enhancers in reporter assays. Previously validated GWAS SNPs yield large deltaSVM scores, and we predict new risk-conferring SNPs for several autoimmune diseases. Thus, deltaSVM provides a powerful computational approach to systematically identify functional regulatory variants.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available