4.7 Article

A novel privacy-preserving federated genome-wide association study framework and its application in identifying potential risk variants in ankylosing spondylitis

期刊

BRIEFINGS IN BIOINFORMATICS
卷 22, 期 3, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbaa090

关键词

federated learning; privacy protection; GWAS; ankylosing spondylitis

资金

  1. National Natural Science Foundation of China [31770988, 31821003]
  2. China Ministry of Science and Technology [2018AAA0100300]
  3. Shanghai Municipal Key Clinical Specialty Fund [shslczdzk02602]
  4. Key Lab of Information Network Security of Ministry of Public Security (the Third Research Institute of Ministry of Public Security) [C19609]

向作者/读者索取更多资源

Genome-wide association studies (GWAS) are used to identify potential risk variants in diseases, but typically require cross-institutional partnerships to overcome sample size limitations and increase statistical power. However, data privacy remains a major challenge in such partnerships.
Genome-wide association studies (GWAS) have been widely used for identifying potential risk variants in various diseases. A statistically meaningful GWAS typically requires a large sample size to detect disease-associated single nucleotide polymorphisms (SNPs). However, a single institution usually only possesses a limited number of samples. Therefore, cross-institutional partnerships are required to increase sample size and statistical power. However, cross-institutional partnerships offer significant challenges, a major one being data privacy. For example, the privacy awareness of people, the impact of data privacy leakages and the privacy-related risks are becoming increasingly important, while there is no de-identification standard available to safeguard genomic data sharing. In this paper, we introduce a novel privacy-preserving federated GWAS framework (iPRIVATES). Equipped with privacy-preserving federated analysis, iPRIVATES enables multiple institutions to jointly perform GWAS analysis without leaking patient-level genotyping data. Only aggregated local statistics are exchanged within the study network. In addition, we evaluate the performance of iPRIVATES through both simulated data and a real-world application for identifying potential risk variants in ankylosing spondylitis (AS). The experimental results showed that the strongest signal of AS-associated SNPs reside mostly around the human leukocyte antigen (HLA) regions. The proposed iPRIVATES framework achieved equivalent results as traditional centralized implementation, demonstrating its great potential in driving collaborative genomic research for different diseases while preserving data privacy.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据