4.6 Article

An Intelligent Framework for Oversubscription Management in CPU-GPU Unified Memory

期刊

JOURNAL OF GRID COMPUTING
卷 21, 期 1, 页码 -

出版社

SPRINGER
DOI: 10.1007/s10723-023-09646-1

关键词

Discrete CPU-GPU system; Unified virtual memory; Oversubscription; Deep learning

向作者/读者索取更多资源

This paper proposes a novel framework for UVM oversubscription management in discrete CPU-GPU systems, which significantly outperforms the state-of-the-art methods on memory-intensive benchmarks. It reduces page thrashing and achieves improved IPC under different levels of memory oversubscription.
Unified virtual memory (UVM) improves GPU programmability by enabling on-demand data movement between CPU memory and GPU memory. However, due to the limited capacity of GPU device memory, oversubscription overhead becomes a major performance bottleneck for data-intensive workloads running on GPUs with UVM. This paper proposes a novel framework for UVM oversubscription management in discrete CPU-GPU systems. It consists of an access pattern classifier followed by a pattern-specific transformer-based model using a novel loss function aiming to reduce page thrashing. A policy engine is designed to leverage the model's result to perform accurate page prefetching and eviction. Our evaluation shows that our proposed framework significantly outperforms the state-of-the-art (SOTA) methods on a set of 11 memory-intensive benchmarks, reducing the number of pages thrashed by 64.4% under 125% memory oversubscription compared to the baseline, while the SOTA method reduces the number of pages thrashed by 17.3%. Compared to the SOTA method, our solution achieves average IPC improvement of 1.52X and 3.66X under 125% and 150% memory oversubscription.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据