4.6 Article

Accelerating Fully Homomorphic Encryption Through Architecture-Centric Analysis and Optimization

Journal

IEEE ACCESS
Volume 9, Issue -, Pages 98772-98789

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3096189

Keywords

Computer applications; computer architecture; cryptography; multicore processing

Funding

  1. Samsung Electronics Company Ltd. [IO201207-07812-01]
  2. National Research Foundation of Korea (NRF) Grant through the Korean Government (MSIT) [2020R1A2C2010601]
  3. Institute of Information and communications Technology Planning and evaluation (IITP) grants through the Korean Government by the Articial Intelligence Graduate School Program (Seoul National University) [2021-0-01343]
  4. [2020-0-00840]
  5. National Research Foundation of Korea [2020R1A2C2010601] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

Homomorphic Encryption (HE) is a popular privacy-preserving approach for cloud computing, with schemes like HE for Arithmetic of Approximate Numbers (HEAAN) gaining popularity due to their support for approximate computations and unlimited arithmetic operations. However, the high computation complexity of HE, especially in ciphertext arithmetic like HE multiplication (HE Mul), has led to a lack of rigorous analysis in accelerating HE and optimizing performance for different parallel processing platforms.
Homomorphic Encryption (HE) has drawn significant attention as a privacy-preserving approach for cloud computing because it allows computation on encrypted messages called ciphertexts. Among the numerous HE schemes proposed thus far, HE for Arithmetic of Approximate Numbers (HEAAN) is rapidly gaining in popularity across a wide range of applications, as it supports messages that can tolerate approximate computations with no limit on the number of arithmetic operations applicable to the ciphertexts. A critical shortcoming of HE is the high computation complexity of ciphertext arithmetic; specifically, HE multiplication (HE Mul) is more than 10,000 times slower than the corresponding multiplication between unencrypted messages. This has led to a large body of HE acceleration studies, including those that exploit FPGAs; however, a rigorous analysis of the computational complexity and data access patterns of HE Mul is lacking. Moreover, the proposals mostly focused on designs with small parameter sizes, making it difficult accurately to estimate the performance of the HE accelerators when conducting a series of complex arithmetic operations. In this paper, we first describe how HE Mul of HEAAN is performed in a manner friendly to non-crypto experts. Then, we conduct a disciplined analysis of its computational and memory-access characteristics, through which we (1) extract parallelism in the key functions composing HE Mul and (2) demonstrate how to map the parallelism effectively to popular parallel processing platforms, CPUs and GPUs, by applying a series of optimizations such as transposing matrices and pinning data to threads. This leads to performance improvements of HE Mul on a CPU and a GPU by 2.06x and 4.05x, respectively, over the reference HEAAN running on a CPU with 24 threads.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available