4.4 Article

Private Genomes and Public SNPs: Homomorphic Encryption of Genotypes and Phenotypes for Shared Quantitative Genetics

Journal

GENETICS
Volume 215, Issue 2, Pages 359-372

Publisher

GENETICS SOCIETY AMERICA
DOI: 10.1534/genetics.120.303153

Keywords

quantitative genetics; homomorphic encryption; genetic privacy

Funding

  1. Biotechnology and Biological Sciences Research Council [BB/S017372/1, BB/R01356X/1, BB/P024726/1, BB/M011585/1]
  2. University of Tennessee Center for Integrative and Translational Genomics
  3. University of Tennessee-Oak Ridge National Laboratory Governor's Chair
  4. National Institute of General Medical Sciences grant [R01 GM-123489]
  5. National Institute of Drug Abuse [P30 DA-044223]
  6. BBSRC [BB/S017372/1] Funding Source: UKRI

Ask authors/readers for more resources

Sharing human genotype and phenotype data is essential to discover otherwise inaccessible genetic associations, but is a challenge because of privacy concerns. Here, we present a method of homomorphic encryption that obscures individuals' genotypes and phenotypes, and is suited to quantitative genetic association analysis. Encrypted ciphertext and unencrypted plaintext are analytically interchangeable. The encryption uses a high-dimensional random linear orthogonal transformation key that leaves the likelihood of quantitative trait data unchanged under a linear model with normally distributed errors. It also preserves linkage disequilibrium between genetic variants and associations between variants and phenotypes. It scrambles relationships between individuals: encrypted genotype dosages closely resemble Gaussian deviates, and can be replaced by quantiles from a Gaussian with negligible effects on accuracy. Likelihood-based inferences are unaffected by orthogonal encryption. These include linear mixed models to control for unequal relatedness between individuals, heritability estimation, and including covariates when testing association. Orthogonal transformations can be applied in a modular fashion for multiparty federated mega-analyses where the parties first agree to share a common set of genotype sites and covariates prior to encryption. Each then privately encrypts and shares their own ciphertext, and analyses all parties' ciphertexts. In the absence of private variants, or knowledge of the key, we show that it is infeasible to decrypt ciphertext using existing brute-force or noise-reduction attacks. We present the method as a challenge to the community to determine its security. Mott et al. show that association between a quantitative trait and genotype can be performed using data that has been transformed by first rotating it in a high-dimensional space. The resulting...

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available