☆ 4.7 Article

Exploring Data Analytics Without Decompression on Embedded GPU Systems

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS (2022)

Journal

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS

Volume 33, Issue 7, Pages 1553-1568

Publisher

IEEE COMPUTER SOC

DOI: 10.1109/TPDS.2021.3119402

Keywords

Graphics processing units; Embedded systems; Data analysis; Parallel processing; Instruction sets; Optimization; Random access memory; TADOC; embedded GPU systems; compression; data analytics

Funding

National Key Research and Development Program of China [2018YFB1004401]
National Natural Science Foundation of China [61732014, 62172419, U20A20226, 61802412]
Beijing Natural Science Foundation [4202031]
Tsinghua University-Peking Union Medical College Hospital Initiative Scientific Research Program [20191080594]
State Key Laboratory of Computer Architecture (ICT, CAS) [CARCHA202007]
GHfund A [20210701]
CCF-Tencent Open Research Fund

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper proposes a novel data analytics method, G-TADOC, for efficient text analytics directly on compression on embedded GPU systems. G-TADOC has three innovations and involves special optimizations for embedded GPUs. Experimental results show that G-TADOC provides higher speedup, performance-per-cost, and energy efficiency compared to the state-of-the-art TADOC.

With the development of computer architecture, even for embedded systems, GPU devices can be integrated, providing outstanding performance and energy efficiency to meet the requirements of different industries, applications, and deployment environments. Data analytics is an important application scenario for embedded systems. Unfortunately, due to the limitation of the capacity of the embedded device, the scale of problems handled by the embedded system is limited. In this paper, we propose a novel data analytics method, called G-TADOC, for efficient text analytics directly on compression on embedded GPU systems. A large amount of data can be compressed and stored in embedded systems, and can be processed directly in the compressed state, which greatly enhances the processing capabilities of the systems. Particularly, G-TADOC has three innovations. First, a novel fine-grained thread-level workload scheduling strategy for GPU threads has been developed, which partitions heavily-dependent loads adaptively in a fine-grained manner. Second, a GPU thread-safe memory pool has been developed to handle inconsistency with low synchronization overheads. Third, a sequence-support strategy is provided to maintain high GPU parallelism while ensuring sequence information for lossless compression. Moreover, G-TADOC involves special optimizations for embedded GPUs, such as utilizing the CPU-GPU shared unified memory. Experiments show that G-TADOC provides 13.2x average speedup compared to the state-of-the-art TADOC. G-TADOC also improves performance-per-cost by 2.6x and energy efficiency by 32.5x over TADOC.

Exploring Data Analytics Without Decompression on Embedded GPU Systems

Journal

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Exploring Data Analytics Without Decompression on Embedded GPU Systems

Journal

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS

Publisher

IEEE COMPUTER SOC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper