4.7 Article

Exploring Data Analytics Without Decompression on Embedded GPU Systems

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPDS.2021.3119402

Keywords

Graphics processing units; Embedded systems; Data analysis; Parallel processing; Instruction sets; Optimization; Random access memory; TADOC; embedded GPU systems; compression; data analytics

Funding

  1. National Key Research and Development Program of China [2018YFB1004401]
  2. National Natural Science Foundation of China [61732014, 62172419, U20A20226, 61802412]
  3. Beijing Natural Science Foundation [4202031]
  4. Tsinghua University-Peking Union Medical College Hospital Initiative Scientific Research Program [20191080594]
  5. State Key Laboratory of Computer Architecture (ICT, CAS) [CARCHA202007]
  6. GHfund A [20210701]
  7. CCF-Tencent Open Research Fund

Ask authors/readers for more resources

This paper proposes a novel data analytics method, G-TADOC, for efficient text analytics directly on compression on embedded GPU systems. G-TADOC has three innovations and involves special optimizations for embedded GPUs. Experimental results show that G-TADOC provides higher speedup, performance-per-cost, and energy efficiency compared to the state-of-the-art TADOC.
With the development of computer architecture, even for embedded systems, GPU devices can be integrated, providing outstanding performance and energy efficiency to meet the requirements of different industries, applications, and deployment environments. Data analytics is an important application scenario for embedded systems. Unfortunately, due to the limitation of the capacity of the embedded device, the scale of problems handled by the embedded system is limited. In this paper, we propose a novel data analytics method, called G-TADOC, for efficient text analytics directly on compression on embedded GPU systems. A large amount of data can be compressed and stored in embedded systems, and can be processed directly in the compressed state, which greatly enhances the processing capabilities of the systems. Particularly, G-TADOC has three innovations. First, a novel fine-grained thread-level workload scheduling strategy for GPU threads has been developed, which partitions heavily-dependent loads adaptively in a fine-grained manner. Second, a GPU thread-safe memory pool has been developed to handle inconsistency with low synchronization overheads. Third, a sequence-support strategy is provided to maintain high GPU parallelism while ensuring sequence information for lossless compression. Moreover, G-TADOC involves special optimizations for embedded GPUs, such as utilizing the CPU-GPU shared unified memory. Experiments show that G-TADOC provides 13.2x average speedup compared to the state-of-the-art TADOC. G-TADOC also improves performance-per-cost by 2.6x and energy efficiency by 32.5x over TADOC.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available