4.4 Article

INTEGRATED QUANTILE RANK TEST (IQRAT) FOR GENE-LEVEL ASSOCIATIONSINTEGRATED QUANTILE RANK TEST (IQRAT) FOR GENE-LEVEL ASSOCIATIONS

期刊

ANNALS OF APPLIED STATISTICS
卷 16, 期 3, 页码 1423-1444

出版社

INST MATHEMATICAL STATISTICS-IMS
DOI: 10.1214/21-AOAS1548

关键词

Quantile process; rank score test; gene-set associations

资金

  1. National Institutes of Health [R01 HG008980, MH095797, AG072272]
  2. National Science Foundation [DMS-1953527]
  3. National Natural Science Foundation of China [12101351]

向作者/读者索取更多资源

Gene-based testing is commonly used in genetic association studies, but existing tests may miss complex associations. This paper proposes a new gene-level association test that integrates quantile rank score process for better accommodation. The proposed test statistics are efficient, provide insights into risk stratification, are distribution free, and computationally efficient. Simulation studies and real data applications show that the proposed approach outperforms conventional mean-based tests.
Gene-based testing is a commonly employed strategy in many genetic association studies. Gene-trait associations can be complex due to underlying population heterogeneity, gene-environment interactions, and various other reasons. Existing gene-based tests, such as burden and sequence kernel association tests (SKAT), are mean-based tests and may miss or underestimate higher-order associations that could be scientifically interesting. In this paper we propose a new family of gene-level association tests that integrate quantile rank score process to better accommodate complex associations. The resulting test statistics have multiple advantages: (1) they are almost as efficient as the best existing tests when the associations are homogeneous across quantile levels and have improved efficiency for complex and heterogeneous associations; (2) they provide useful insights into risk stratification; (3) the test statistics are distribution free and could hence accommodate a wide range of underlying distributions, and (4) they are computationally efficient. We established the asymptotic properties of the proposed tests under the null and alternative hypotheses and conducted large-scale simulation studies to investigate their finite sample performance. The performance of the proposed approach is compared with that of conventional mean-based tests, that is, the burden and SKAT tests, through simulation studies and applications to a metabochip dataset on lipid traits and to the genotype-tissue expression data in GTEx to identify eGenes, that is, genes whose expression levels are associated with cis-eQTLs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据