4.6 Article

Efficiency Near the Edge: Increasing the Energy Efficiency of FFTs on GPUs for Real-Time Edge Computing

期刊

IEEE ACCESS
卷 9, 期 -, 页码 18167-18182

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3053409

关键词

Graphics processing units; Clocks; Hardware; Libraries; Power demand; Data processing; Real-time systems; Energy efficiency; high performance computing; real-time systems; parallel architectures; fast Fourier transforms

资金

  1. Science and Technology Facilities Council (STFC), U.K. [ST/T000570/1]
  2. OP VVV MEYS Funded Project Research Center for Informatics'' [CZ.02.1.01/0.0/0.0/16_019/0000765]
  3. STFC [ST/T000570/1] Funding Source: UKRI

向作者/读者索取更多资源

The SKA project aims to develop the world's largest radio telescope, requiring energy-efficient computational algorithms and in-situ data processing. Energy efficiency is a growing concern in modern computing, with hardware frequency scaling showing potential for significant power consumption reductions.
The Square Kilometer Array (SKA) is an international initiative for developing the world's largest radio telescope with a total collecting area of over a million square meters. The scale of the operation, combined with the remote location of the telescope, requires the use of energy-efficient computational algorithms. This, along with the extreme data rates that will be produced by the SKA and the requirement for a real-time observing capability, necessitates in-situ data processing in an edge style computing solution. More generally, energy efficiency in the modern computing landscape is becoming of paramount concern. Whether it be the power budget that can limit some of the world's largest supercomputers, or the limited power available to the smallest Internet-of-Things devices. In this article, we study the impact of hardware frequency scaling on the energy consumption and execution time of the Fast Fourier Transform (FFT) on NVIDIA GPUs using the cuFFT library. The FFT is used in many areas of science and it is one of the key algorithms used in radio astronomy data processing pipelines. Through the use of frequency scaling, we show that we can lower the power consumption of the NVIDIA A100 GPU when computing the FFT by up to 47% compared to the boost clock frequency, with less than a 10% increase in the execution time. Furthermore, using one common core clock frequency for all tested FFT lengths, we show on average a 43% reduction in power consumption compared to the boost core clock frequency with an increase in the execution time still below 10%. We demonstrate how these results can be used to lower the power consumption of existing data processing pipelines. These savings, when considered over years of operation, can yield significant financial savings, but can also lead to a significant reduction of greenhouse gas emissions.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据