4.7 Article

Boosting signals in gene-based association studies via efficient SNP selection

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 15, Issue 2, Pages 279-291

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbs087

Keywords

entropy; gene-centric association; mutual information; set-based association

Funding

  1. US National Science Foundation [DMS-1209112, IOS-1237969]
  2. Direct For Mathematical & Physical Scien
  3. Division Of Mathematical Sciences [1209112] Funding Source: National Science Foundation
  4. Division Of Integrative Organismal Systems
  5. Direct For Biological Sciences [1237969] Funding Source: National Science Foundation

Ask authors/readers for more resources

Set-based association studies based on genes or pathways have shown great promise in interpreting association signals associated with complex diseases. These approaches are particularly useful when variants in a set have moderate effects and are difficult to be detected with single marker analysis, especially when variants function jointly in a complicated manner. The set-based analyses use a summary statistic such as the maximum or average of individual signal (e.g. a chi-square statistic) over all variants in a set, or consider their joint distribution to assess the significance of the set. The signal obtained with this treatment, however, could be potentially diluted when noisy variants are not taken good care of, leading to either inflated false negatives or false positives. Thus, the selection of disease informative single-nucleotide polymorphism (diSNPs) plays a crucial role in improving the power of the set-based association study. In this work, we propose an efficient diSNP selection method based on the information theory. We select diSNP variants by considering their relative information contribution to a disease status, which is different from the usual tag SNP selection. The relative merit of pre-selecting diSNPs in a set-based association analysis is demonstrated through extensive simulation studies and real data analysis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available