4.6 Article

Privacy Preserving Classification of EEG Data Using Machine Learning and Homomorphic Encryption

Journal

APPLIED SCIENCES-BASEL
Volume 11, Issue 16, Pages -

Publisher

MDPI
DOI: 10.3390/app11167360

Keywords

privacy-preserving computations; homomorphic encryption; machine learning; EEG signals

Funding

  1. Romanian Ministry of Education and Research, CCCDI-UEFISCDI within PNCDI III [PN-III-P2-2.1-PED-2019-2415]
  2. Ministry of Education and Research, CNCS/CCCDI-UEFISCDI [PN-III-P3-3.6-H2020-2020-0145/2021]
  3. European Union [875351]

Ask authors/readers for more resources

The study introduces an encoding method to enable homomorphic encryption schemes to operate on real-valued numbers of arbitrary precision and size in EEG signal scenarios. The computational time for training models increases but remains manageable, while the inference time remains in the order of milliseconds. The prediction performance of models operating on encoded and encrypted data is comparable to standard models operating on plaintext data.
Data privacy is a major concern when accessing and processing sensitive medical data. A promising approach among privacy-preserving techniques is homomorphic encryption (HE), which allows for computations to be performed on encrypted data. Currently, HE still faces practical limitations related to high computational complexity, noise accumulation, and sole applicability the at bit or small integer values level. We propose herein an encoding method that enables typical HE schemes to operate on real-valued numbers of arbitrary precision and size. The approach is evaluated on two real-world scenarios relying on EEG signals: seizure detection and prediction of predisposition to alcoholism. A supervised machine learning-based approach is formulated, and training is performed using a direct (non-iterative) fitting method that requires a fixed and deterministic number of steps. Experiments on synthetic data of varying size and complexity are performed to determine the impact on runtime and error accumulation. The computational time for training the models increases but remains manageable, while the inference time remains in the order of milliseconds. The prediction performance of the models operating on encoded and encrypted data is comparable to that of standard models operating on plaintext data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available