4.4 Article

Disentangling genetic feature selection and aggregation in transcriptome-wide association studies

期刊

GENETICS
卷 220, 期 2, 页码 -

出版社

GENETICS SOCIETY AMERICA
DOI: 10.1093/genetics/iyab216

关键词

statistical genetics; transcriptome-wide association studies; feature selection; kernel machine; statistical power

资金

  1. Natural Sciences and Engineering Research Council of Canada (NSERC) [RGPIN2017-04860]
  2. Canada Foundation for Innovation JELF grant [36605]
  3. New Frontiers in Research Fund [NFRFE-2018-00748]
  4. HBI Pilot grant
  5. NSERC [RGPIN-2018-05147]
  6. University of Calgary VPR Catalyst grant
  7. Alberta Children's Hospital Research Institute (ACHRI) scholarship
  8. NSERC USRA award

向作者/读者索取更多资源

This study investigates the core component of genetically regulated expression (GReX) in transcriptome-wide association studies (TWAS). The research shows that separating feature selection and feature aggregation steps can improve the predictive accuracy of TWAS compared to standard protocols relying on GReX. Furthermore, the study highlights the importance of methodological research focusing on optimal combinations of feature selection and aggregation approaches to enhance TWAS protocols.
The success of transcriptome-wide association studies (TWAS) has led to substantial research toward improving the predictive accuracy of its core component of genetically regulated expression (GReX). GReX links expression information with genotype and phenotype by playing two roles simultaneously: it acts as both the outcome of the genotype-based predictive models (for predicting expressions) and the linear combination of genotypes (as the predicted expressions) for association tests. From the perspective of machine learning (considering SNPs as features), these are actually two separable steps-feature selection and feature aggregation-which can be independently conducted. In this study, we show that the single approach of GReX limits the adaptability of TWAS methodology and practice. By conducting simulations and real data analysis, we demonstrate that disentangled protocols adapting straightforward approaches for feature selection (e.g., simple marker test) and aggregation (e.g., kernel machines) outperform the standard TWAS protocols that rely on GReX. Our development provides more powerful novel tools for conducting TWAS. More importantly, our characterization of the exact nature of TWAS suggests that, instead of questionably binding two distinct steps into the same statistical form (GReX), methodological research focusing on optimal combinations of feature selection and aggregation approaches will bring higher power to TWAS protocols.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据