Journal
BRIEFINGS IN BIOINFORMATICS
Volume 15, Issue 2, Pages 279-291Publisher
OXFORD UNIV PRESS
DOI: 10.1093/bib/bbs087
Keywords
entropy; gene-centric association; mutual information; set-based association
Funding
- US National Science Foundation [DMS-1209112, IOS-1237969]
- Direct For Mathematical & Physical Scien
- Division Of Mathematical Sciences [1209112] Funding Source: National Science Foundation
- Division Of Integrative Organismal Systems
- Direct For Biological Sciences [1237969] Funding Source: National Science Foundation
Ask authors/readers for more resources
Set-based association studies based on genes or pathways have shown great promise in interpreting association signals associated with complex diseases. These approaches are particularly useful when variants in a set have moderate effects and are difficult to be detected with single marker analysis, especially when variants function jointly in a complicated manner. The set-based analyses use a summary statistic such as the maximum or average of individual signal (e.g. a chi-square statistic) over all variants in a set, or consider their joint distribution to assess the significance of the set. The signal obtained with this treatment, however, could be potentially diluted when noisy variants are not taken good care of, leading to either inflated false negatives or false positives. Thus, the selection of disease informative single-nucleotide polymorphism (diSNPs) plays a crucial role in improving the power of the set-based association study. In this work, we propose an efficient diSNP selection method based on the information theory. We select diSNP variants by considering their relative information contribution to a disease status, which is different from the usual tag SNP selection. The relative merit of pre-selecting diSNPs in a set-based association analysis is demonstrated through extensive simulation studies and real data analysis.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available